(data stored in ACNUC7421 zone)

EMBL: M11047

ID   M11047; SV 1; linear; genomic DNA; STD; PRO; 4790 BP.
XX
AC   M11047; M11045-M11046;
XX
DT   02-JUL-1986 (Rel. 09, Created)
DT   17-APR-2005 (Rel. 83, Last updated, Version 5)
XX
DE   S.typhimurium araBAD operon: araB, araA, and araD genes coding for
DE   ribulokinase, L-arabinose isomerase, and L-ribulose-5-phosphate
DE   4-epimerase.
XX
KW   araA gene; araB gene; araBAD operon; araD gene; epimerase; isomerase;
KW   L-arabinose isomerase; L-ribulose-5-phosphate 4-epimerase; ribulokinase.
XX
OS   Salmonella enterica subsp. enterica serovar Typhimurium
OC   Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC   Enterobacteriaceae; Salmonella.
XX
RN   [1]
RP   1-1829
RX   DOI; 10.1016/0378-1119(85)90301-4.
RX   PUBMED; 2989100.
RA   Lin H.-C., Lei S.-P., Wilcox G.;
RT   "The araBAD operon of Salmonella typhimurium LT2. I. Nucleotide sequence of
RT   araB and primary structure of its product, ribulokinase";
RL   Gene 34(1):111-122(1985).
XX
RN   [2]
RP   1749-3342
RX   DOI; 10.1016/0378-1119(85)90302-6.
RX   PUBMED; 3891513.
RA   Lin H.-C., Lei S.-P., Wilcox G.;
RT   "The araBAD operon of Salmonella typhimurium LT2. II. Nucleotide sequence
RT   of araA and primary structure of its product, L-arabinose isomerase";
RL   Gene 34(1):123-128(1985).
XX
RN   [3]
RP   3271-4790
RX   DOI; 10.1016/0378-1119(85)90303-8.
RX   PUBMED; 3891514.
RA   Lin H.-C., Lei S.-P., Wilcox G.;
RT   "The araBAD operon of Salmonella typhimurium LT2. III. Nucleotide sequence
RT   of araD and its flanking regions, and primary structure of its product,
RT   L-ribulose-5-phosphate 4-epimerase";
RL   Gene 34(1):129-134(1985).
XX
DR   MD5; 4c99dde07ff10335c1554ac69baf2860.
DR   EuropePMC; PMC368370; 15006759.
XX
CC   The sequence preceding araB coding region is part of the
CC   controlling region between the araC gene and araBAD operon.  A
CC   potential ribosome binding site for the araB gene is located at
CC   positions 109-112. A 10-bp intercistronic region is located between
CC   the araB and araA genes. A potential ribosome binding site,
CC   'taagga', is located 7 bp distal from the start codon of araA. The
CC   site overlaps the stop codon of araB .
CC   A 143-bp intercistronic region exists between the araA and araD
CC   genes. The presumed ribosome binding site for araD is located at
CC   positions 3473-3475.  This region contains several short
CC   complementary repeated sequences which can form stable stem-loop
CC   secondary structures. There is also a stem-loop structure 80 bp
CC   beyond the stop codon of araD which is followed by an A+T-rich
CC   sequence.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4790
FT                   /organism="Salmonella enterica subsp. enterica serovar
FT                   Typhimurium"
FT                   /mol_type="genomic DNA"
FT                   /db_xref="taxon:90371"
FT   mRNA            93..>4229
FT                   /note="araBAD operon mRNA"
FT   CDS_pept        120..1829
FT                   /codon_start=1
FT                   /transl_table=11
FT                   /gene="araB"
FT                   /product="ribulokinase"
FT                   /EC_number="2.7.1.16"
FT                   /db_xref="GOA:P06188"
FT                   /db_xref="InterPro:IPR005929"
FT                   /db_xref="InterPro:IPR018485"
FT                   /db_xref="UniProtKB/Swiss-Prot:P06188"
FT                   /protein_id="AAA27023.1"
FT                   /translation="MAIAIGLDFGSDSVRALAVDCATGDEIATSVEWYPRWQEGRYCDG
FT                   PNNQFRHHPRDYMESMEAALKAVLAQLSAAQRANVVGIGVDSTGSTPAPIDADGNVLAL
FT                   RPEFAENPNAMFVLWKDHTAVEEADEITRLCHKPGKVDYSRYIGGIYSSEWFWAKILHV
FT                   TRQDSAVAQAAVSWIELCDWVPALLSGTTRPQDIRRGRCSAGHKTLWHESWGGLPPASF
FT                   FDELDPCINRHLRYPLFSETFTADLPVGTLCAEWAQRLDLPESVVISGGAFDCHMGAVG
FT                   AGAQPNTLVKVIGTSTCDILIADKQSVGDRAVKGICGQVDGSVVPNFIGLEAGQSAFGD
FT                   IYAWFSRVLSWPLEQLAAQHPELKPQINASQKQLLPALTDAWAKNPSLDHLPVVLDWFN
FT                   GRRTPNANQRLKGVITDLNLATDAPALFGGLVASTAFGARAIQECFTDQGIAVNNVMAL
FT                   GGIARKNQVIMQVCCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIPAAQQSMASA
FT                   VERTLRPHPEQAQRFEQLYRRYQQWALSAEQHYLPTAAPAPTTPANQAILTH"
FT   CDS_pept        1840..3342
FT                   /codon_start=1
FT                   /transl_table=11
FT                   /gene="araA"
FT                   /product="L-arabinose isomerase"
FT                   /EC_number="5.3.1.4"
FT                   /db_xref="GOA:P06189"
FT                   /db_xref="InterPro:IPR003762"
FT                   /db_xref="InterPro:IPR004216"
FT                   /db_xref="InterPro:IPR009015"
FT                   /db_xref="InterPro:IPR024664"
FT                   /db_xref="InterPro:IPR038583"
FT                   /db_xref="UniProtKB/Swiss-Prot:P06189"
FT                   /protein_id="AAA27024.1"
FT                   /translation="MTIFDNYEVWFVIGSQHLYGAETLRQVTQHAEHVVNALNTEAKLP
FT                   CKLVLKPLGTSPDEITAICRDANYDDRCAGLVVWLHTFSPAKMWINGLSILNKPLLQFH
FT                   TQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKEAHTRIGAW
FT                   MRQAVSKQDTRQLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSIG
FT                   DGDINALIDEYESSYTLTPATQIHGDKRQNVREAAGIELGMKRFLEQGGFHAFTTTFED
FT                   LHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFE
FT                   KGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKEDPARLIFNTQTGPAIVASLIDL
FT                   GDRYRLLVNCIDTVKTPHSLPKLPVRNALWKAQPDLPTASEAWILAGGAHHTVFSHALD
FT                   LNDMRQFAEIHDIEIAVIDNDTHLPAFKDALRWNEVYYGFKR"
FT   CDS_pept        3483..4229
FT                   /codon_start=1
FT                   /transl_table=11
FT                   /gene="araD"
FT                   /product="L-ribulose-5-phosphate 4-epimerase"
FT                   /EC_number="5.1.3.4"
FT                   /db_xref="GOA:P06190"
FT                   /db_xref="InterPro:IPR001303"
FT                   /db_xref="InterPro:IPR004661"
FT                   /db_xref="InterPro:IPR033748"
FT                   /db_xref="InterPro:IPR036409"
FT                   /db_xref="UniProtKB/Swiss-Prot:P06190"
FT                   /protein_id="AAA27025.1"
FT                   /translation="MLEDLKRQVLEANLALPKHNLVTLTWGNVSAVDRERGVLVIKPSG
FT                   VDYSVMTADDMVVVSLESGEVVEGHKKPSSDTPTHRLLYQAFPTIGGIVHTHSRHATIW
FT                   AQAGQPIPATGTTHADYFYGTIPCTRKMTEAEINGEYEWETGNVIVETFEKQGIDAAQM
FT                   PGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRHLRRSCPTCSNPCWINTIYAN
FT                   TAQKPITGSNASKNASHGGRVDESGR"
XX
SQ   Sequence 4790 BP; 1096 A; 1345 C; 1305 G; 1044 T; 0 other;
     cgtcacactt tgcaaagcat tagcattttt gtccataaga ttagcggatc ctgcctgacg        60
     gtttttgccg cgactctcta ctgtttctcc atacctgttt ttctggatgg agtaagacga       120
     tggcaattgc aattggcctc gattttggca gtgattcagt gcgcgctctg gcagtggact       180
     gcgccaccgg cgacgagatc gccaccagcg tagagtggta tccgcgctgg caagaaggcc       240
     gttattgcga cggcccgaac aaccagttcc gtcatcatcc gcgcgactac atggagtcaa       300
     tggaggccgc gctgaaagcc gttctggcac aattaagcgc cgcgcaacgc gcaaatgtcg       360
     ttggcattgg cgttgacagc accggctcta cgccagcgcc gattgacgcc gacggtaacg       420
     tcctggcgct gcgtccagag ttcgccgaga acccgaatgc gatgtttgtg ctgtggaaag       480
     atcacaccgc cgtggaagag gccgacgaaa tcactcgtct gtgccataag ccaggcaagg       540
     tcgactactc ccgctatatt ggcggcattt actccagcga atggttctgg gcgaagattc       600
     tgcacgtcac ccggcaggat agcgccgtcg cgcaggccgc cgtctcgtgg attgagctgt       660
     gcgactgggt gccggcgctg ctttccggca ccactcgccc gcaggatatc cgccgtggcc       720
     gctgcagcgc cgggcacaaa acgctgtggc atgaaagctg gggcggtctg ccgcccgcga       780
     gcttctttga tgaactcgat ccgtgcatta accgtcatct gcgctacccg ttatttagcg       840
     aaaccttcac cgccgatctg cccgtgggca ccctgtgcgc cgaatgggcg cagcgcctcg       900
     acttgccgga aagcgtagtg atttccggcg gcgcgttcga ctgtcacatg ggcgcggtcg       960
     gcgcgggcgc acagcccaat acgctggtga aagtcatcgg cacgtctacc tgcgacattc      1020
     tgattgcgga taaacagagc gtcggggatc gcgccgtgaa aggcatttgc ggtcaggttg      1080
     acggcagcgt ggtgccgaac tttatcggtc tggaagcggg gcaatctgct ttcggcgata      1140
     tctacgcctg gtttagccgc gtgttgagct ggccgctgga gcaacttgcc gcgcagcacc      1200
     cggaactgaa accccagatt aacgccagcc agaagcagct actgccagcg ctcaccgacg      1260
     cctgggcgaa aaatccgtcc ctggatcacc tgccggtggt gctcgactgg tttaacggtc      1320
     gccgcacgcc aaacgctaat cagcgtctga aaggcgtcat taccgatctc aatctcgcca      1380
     ccgacgcgcc agcgctgttt ggcggtctgg tcgcttcgac cgccttcggc gcgcgcgcca      1440
     ttcaggagtg ttttaccgat cagggtatcg cggtcaataa cgtgatggcg cttggcggca      1500
     tcgcccgtaa aaatcaggtc attatgcagg tctgctgcga cgtactgaat cgtccgttgc      1560
     agatcgtcgc ttccgaccag tgttgcgcat taggcgccgc tatctttgcc gccgtcgctg      1620
     cgaaagtcca tgccgacatt ccagccgccc agcaaagcat ggcgagcgcg gtagaacgca      1680
     ctctgcgccc ccaccctgaa caggcgcaac gcttcgaaca gctttaccgc cgctaccagc      1740
     agtgggcgct aagcgcagaa caacattatc ttccgactgc cgcgccggcg ccaacgaccc      1800
     cggccaatca ggcaatcctg actcattaag gacacgacaa tgacgatttt tgataattat      1860
     gaagtatggt ttgtgattgg cagccagcat ttgtatggcg cagaaaccct gcgtcaggtc      1920
     acccaacatg ccgagcatgt ggtcaacgcg ctgaataccg aagccaaact gccatgtaaa      1980
     ctggtattaa aaccgctggg cacctcgccg gatgagatta ccgccatttg tcgtgacgcc      2040
     aattatgacg atcgctgcgc agggctggtg gtctggctgc acaccttctc cccggccaaa      2100
     atgtggatca acgggctgag tatccttaac aaaccactac tgcaattcca tacccaattt      2160
     aacgccgccc tgccgtggga cagcattgat atggacttta tgaacctgaa ccagactgcg      2220
     cacggcggtc gtgagttcgg ttttatcggc gcgcggatgc gccagcagca cgcggtcgtc      2280
     accggtcact ggcaggataa agaggcccat acgcgtatcg gtgcctggat gcgccaggcg      2340
     gtctctaaac aggatacccg ccagctaaaa gtctgccgct tcggcgacaa tatgcgtgaa      2400
     gtcgcagtga ctgacggtga taaagtggcc gcgcaaatca aatttggctt ttcggtcaat      2460
     acctgggcgg tcggcgatct ggtgcaggtg gtgaattcta tcggcgacgg cgatatcaac      2520
     gctctgattg acgagtatga aagcagctat accctgacgc ccgccaccca aatccacggc      2580
     gataaacgcc agaacgtgcg ggaggcggcg ggtattgaac tcggtatgaa gcgtttcctg      2640
     gaacagggcg gcttccacgc attcactact acctttgaag atttacacgg tctgaaacag      2700
     cttccgggtc tggccgtaca gcgtctgatg cagcaaggct acggctttgc gggcgaaggc      2760
     gactggaaaa ccgccgctct gcttcgcatt atgaaagtga tgtcaaccgg tctgcagggc      2820
     ggcacctcat ttatggagga ttacacctac cacttcgaga aaggcaacga tctggtgctc      2880
     ggctcgcaca tgctggaagt gtgtccgtcc atcgcggtgg aagagaaacc gatcctcgac      2940
     gtccagcacc tcggcattgg cggcaaggaa gatccggcgc gtttgatttt caatacccaa      3000
     accggcccgg cgatcgtcgc cagcctgatc gacctcggcg atcgttatcg cctgctggtc      3060
     aactgcattg acaccgtaaa aacgccgcac tccctgccga aactgccggt gcgtaacgcg      3120
     ctgtggaagg cgcagccgga tctgccgacc gcctccgaag cgtggattct ggctggcggc      3180
     gcgcaccata ccgtcttcag ccacgcgctg gatctgaacg atatgcgcca gtttgcagaa      3240
     atacacgata tcgaaatcgc ggtgattgat aacgataccc atctgccggc ctttaaggac      3300
     gcgctgcgct ggaacgaggt gtattacggg ttcaaacgtt aattggtgaa acggattgcc      3360
     cggtggcact gcgtttaccg ggcctacggt cctgtaggcc gaataaggca tttatgtcgc      3420
     catccggcac accgtcgctc gtaggccgga taagcgaagc gccatccggc agggagaaaa      3480
     caatgttaga agatctcaaa cgccaggtac tggaagctaa tctggcgctg ccaaaacaca      3540
     acctggtcac ccttacctgg ggtaacgtta gcgccgtcga tcgcgaacgc ggcgtactgg      3600
     tgattaagcc gtccggcgtc gattatagcg tcatgaccgc tgacgatatg gtggtggtca      3660
     gcctggagag cggtgaagtc gttgaaggtc ataagaaacc gtcgtccgat acgccaaccc      3720
     accgtctgtt gtaccaggca tttccgacta tcggcggcat cgtacacacc cattcgcgcc      3780
     acgcgactat ctgggcgcag gcgggtcagc caattccggc gacgggaacc acccatgccg      3840
     actatttcta cggtacgatt ccctgcactc gcaaaatgac cgaggcggaa attaatggcg      3900
     agtatgaatg ggaaacgggc aatgtcattg ttgaaacctt tgaaaaacaa ggcattgacg      3960
     ccgctcaaat gcccggcgtg cttgtccatt cgcacggccc gtttgcctgg ggtaaaaatg      4020
     ccgaggatgc agtgcataac gccatcgtgc tggaagaagt ggcctatatg gggatcttct      4080
     gccgccactt gcgccgcagt tgcccgacat gcagcaatcc ctgctggata aacactatct      4140
     acgcaaacac ggcgcaaaag cctattacgg gcagtaatgc ctctaaaaac gcgtcccatg      4200
     gggggcgcgt tgatgaatct ggtcggtgat atattcagca aatgcgcttt gatagacgta      4260
     atgatcagaa ctcacatatt caataatatt gtcataatgt ccctgccacg cttttccttc      4320
     cagcgcatgg aagaaaatat aatcttcgat tgttgactgc cagcgttgcc catttaacag      4380
     atagttaata atggtatccc gatgtccgtt ttttctgtcg tgtccttgcc agtgaaaaaa      4440
     agcattgccg ttttcaataa tctcggtacg ccaaatctgt tctgtccatg ttttatactc      4500
     aaaaaatcga ctcacggttt ttatggaagg gttagcgcgt tgagtattga cgaaaagata      4560
     acggtcgttc cctaccagac gcgcctgcat actcacattc ataaaagatc attcccgaat      4620
     accacaaatt ttgataaaaa cacccgcacc cgaaagtcaa aataaaatta tattctaaaa      4680
     taaaaattaa attatgcaga gagttcccga cgaattcgca ctgtaatcca tttttattta      4740
     accatagcgg ccaattggaa tattatattt ctacctgacg gtgcggatgt                 4790
//

If you have problems or comments...

PBIL Back to PBIL home page