(data stored in ACNUC16935 zone)

EMBL: M23433

ID   M23433; SV 1; linear; genomic DNA; STD; INV; 3797 BP.
XX
AC   M23433;
XX
DT   06-JUL-1989 (Rel. 20, Created)
DT   14-NOV-2006 (Rel. 89, Last updated, Version 4)
XX
DE   C.elegans polyubiquitin (UbiA) gene, complete cds.
XX
KW   polyubiquitin; ubiquitin.
XX
OS   Caenorhabditis elegans
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC   Caenorhabditis.
XX
RN   [1]
RP   1-3797
RX   PUBMED; 2538720.
RA   Graham R.W., Jones D., Candidio E.P.M.;
RT   "UbiA, the major polyubiquitin locus in Caenorhabditis elegans, has unusual
RT   structural features and is constitutively expressed";
RL   Mol. Cell. Biol. 9(1):268-277(1989).
XX
DR   MD5; ae27556b4c35537cf33254acea65b364.
DR   Ensembl-Gn; WBGene00006727; caenorhabditis_elegans.
DR   Ensembl-Tr; F25B5.4a.1; caenorhabditis_elegans.
DR   Ensembl-Tr; F25B5.4a.2; caenorhabditis_elegans.
XX
CC   Draft entry and computer-readable sequence for [1] kindly submitted
CC   by E.P.M.Candido, 07-APR-1989.  UbiA is transcribed as a
CC   polycistronic mRNA which contains 11 tandem repeats of ubiquitin
CC   sequence and possesses a 2-amino-acid carboxy-terminal extension on
CC   the final repeat.  Mature UbiA mRNA acquires a 22-nucleotide leader
CC   sequence via a trans-splicing reaction involving a 100-nucleotide
CC   splice leader RNA derived from a different chromosome.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3797
FT                   /organism="Caenorhabditis elegans"
FT                   /map="chromosome 3"
FT                   /mol_type="genomic DNA"
FT                   /db_xref="taxon:6239"
FT   repeat_region   45..56
FT                   /note="region of dyad symmetry"
FT   prim_transcript 412..3797
FT                   /note="polyubiquitin mRNA and introns"
FT   misc_feature    862..863
FT                   /note="splice site"
FT   CDS_pept        join(867..1005,1059..1742,1791..2474,2523..3206,3256..3581)
FT                   /codon_start=1
FT                   /gene="polyubiquitin"
FT                   /db_xref="GOA:P0CG71"
FT                   /db_xref="InterPro:IPR000626"
FT                   /db_xref="InterPro:IPR019954"
FT                   /db_xref="InterPro:IPR019956"
FT                   /db_xref="InterPro:IPR029071"
FT                   /db_xref="UniProtKB/Swiss-Prot:P0CG71"
FT                   /protein_id="AAA28154.1"
FT                   /translation="MQIFVKTLTGKTITLEVEASDTIENVKAKIQDKEGIPPDQQRLIF
FT                   AGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEASDTIENVKA
FT                   KIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGK
FT                   TITLEVEASDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLV
FT                   LRLRGGMQIFVKTLTGKTITLEVEASDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDG
FT                   RTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEASDTIENVKAKIQDKEGI
FT                   PPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEA
FT                   SDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQ
FT                   IFVKTLTGKTITLEVEASDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNI
FT                   QKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEASDTIENVKAKIQDKEGIPPDQQRLI
FT                   FAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEASDTIENVK
FT                   AKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTG
FT                   KTITLEVEASDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHL
FT                   VLRLRGGMQIFVKTLTGKTITLEVEASDTIENVKAKIQDKEGIPPDQQRLIFAGKQLED
FT                   GRTLSDYNIQKQSTLHLVLRLRGGDI"
FT   exon            <867..1005
FT                   /gene="polyubiquitin"
FT                   /number=1
FT   repeat_region   867..1147
FT                   /note="UbiA repeat unit 1"
FT   intron          1006..1058
FT                   /note="UbiA intron A"
FT   exon            1059..1742
FT                   /gene="polyubiquitin"
FT                   /number=2
FT   repeat_region   1148..1375
FT                   /note="UbiA repeat unit 2"
FT   repeat_region   1376..1603
FT                   /note="UbiA repeat unit 3"
FT   repeat_region   1604..1879
FT                   /note="UbiA repeat unit 4"
FT   intron          1743..1790
FT                   /note="UbiA intron B"
FT   exon            1791..2474
FT                   /gene="polyubiquitin"
FT                   /number=3
FT   repeat_region   1880..2107
FT                   /note="UbiA repeat unit 5"
FT   repeat_region   2108..2335
FT                   /note="UbiA repeat unit 6"
FT   repeat_region   2336..2611
FT                   /note="UbiA repeat unit 7"
FT   intron          2475..2522
FT                   /note="UbiA intron C"
FT   exon            2523..3206
FT                   /gene="polyubiquitin"
FT                   /number=4
FT   repeat_region   2612..2839
FT                   /note="UbiA repeat unit 8"
FT   repeat_region   2840..3067
FT                   /note="UbiA repeat unit 9"
FT   repeat_region   3068..3344
FT                   /note="UbiA repeat unit 10"
FT   intron          3207..3255
FT                   /note="UbiA intron D"
FT   exon            3256..>3581
FT                   /gene="polyubiquitin"
FT                   /number=5
FT   repeat_region   3345..3572
FT                   /note="UbiA repeat unit 11"
XX
SQ   Sequence 3797 BP; 1092 A; 908 C; 768 G; 1029 T; 0 other;
     ccccttcttc ctcatcattt ctttccctac acagcactct agaatgttct tcttgtgcag        60
     aaagagtgcc gtttgagtca gcgacccccc ccccccccct cctttctctt gctcttccta       120
     ctggttctcg taataggcga cttcttgcta acagaaagtg agcatagcaa cattttttac       180
     tttgtggcct tcaataatac gtgcgtcgtt taattagaat gtttgagtaa agttcaacgt       240
     gtagattcaa tattcacgtt ttgggcgctc tttaatttat tactgtcaag aatcagttta       300
     ccaaacggtg agtttctttt ttttttgtct aattgtaaga tttagcgggg taaaaccaac       360
     agaaatgtca tgcttttttg aataatctca atcagttgtt atatgaatta ttttcccatt       420
     ttagcaatac tgcttggtag tatttcggtc agagaaacga ggacatcagc tgaacatctg       480
     cgtctctaac aacactcggg aagcgagtca gtgtgcgcgt gcgttggggt tttatccgat       540
     cgttgagcgg gcatacagca gtcatacacc ccattcgacc agactccgct cgcgtgccac       600
     cttgtctcca ttctcatttc acttgtctct actcggacat tactcctcat cgatagctct       660
     ttactaccat tttacttttt atgcctttct ttttcgtttg acttgcctat acgagtgggg       720
     acaagtttgc tttgttagtc ttagctagtg tatcgatttt ttgggtaata tttcgcaact       780
     ttctaggact ttctttcata atcacctctt ctctcgcctc ctcattccag ttttattcgc       840
     actcattttc tattttttca gcaatcatgc aaatcttcgt caaaacgttg actggaaaaa       900
     ctatcaccct ggaggtggag gcttccgata ccatcgaaaa tgtcaaagcc aagatccaag       960
     acaaggaagg aattccacca gatcagcaga gacttatttt tgctggtacg ttggcaaaat      1020
     atctaatatt tgacctaaaa tttattatat attttcagga aagcaactcg aggatggccg      1080
     taccctttcg gattacaata tccagaagga atcaaccctc catttggtcc tccgcctaag      1140
     aggaggaatg cagatcttcg tcaagacttt gaccggaaag actattacac ttgaggttga      1200
     agcttctgac actatcgaga atgtgaaggc caagatccaa gacaaggaag gtatccctcc      1260
     ggatcaacag cgtttgatct ttgccggaaa gcaactcgag gatggccgta ctctctccga      1320
     ttacaacatc caaaaggagt ctactcttca tctggttctg cgtctccgag gaggaatgca      1380
     aatcttcgtc aagactctta ctggaaagac catcaccctc gaagtcgaag cctccgatac      1440
     catcgagaac gtgaaggcca agattcagga caaggaagga attccaccag atcagcagcg      1500
     tctcatcttc gccggaaagc agctcgagga cggccgcacc ctttctgact acaacatcca      1560
     gaaggaatct actcttcact tggttcttcg tttgagagga ggaatgcaga tctttgtcaa      1620
     gactttgact ggaaagacca tcacacttga agttgaagct tccgacacga tcgagaacgt      1680
     caaggccaag attcaagaca aggagggaat cccgccagat cagcagcgtc ttatctttgc      1740
     tggtatgtta catataacaa attttgttca tgagagacta attttttcag gaaagcaatt      1800
     ggaagatgga cgcacactct ctgattacaa tattcagaaa gagtctactc tccacttggt      1860
     gctccgtctc agaggaggta tgcagatctt cgtcaagaca ttgactggaa agaccatcac      1920
     acttgaagtc gaagcttccg acacgatcga aaatgtcaag gctaagattc aagataaaga      1980
     aggaatccca ccagatcagc aaagacttat cttcgccgga aagcagctcg aggacggccg      2040
     caccctttcg gactacaaca tccagaagga atcaactctt catttggttc tccgtttgag      2100
     aggaggtatg cagatcttcg tcaagacatt gaccggaaag accatcaccc tcgaagtcga      2160
     agcctccgac accatcgaaa atgtcaaggc caagatccaa gacaaggaag gaattcctcc      2220
     agatcagcaa cgtctcatct tcgctggaaa gcagctcgaa gacggccgca ccctttcgga      2280
     ctacaacatc cagaaggaat caactcttca tttggttctc cgtttgagag gaggtatgca      2340
     aatcttcgtg aagactttga ctggaaagac tatcaccctc gaagtcgaag cttctgatac      2400
     catcgaaaat gtgaaggcca agatccagga caaggaagga atcccaccag atcagcagcg      2460
     tcttatcttt gccggtagct tatatagata tacataactc aaatcaacta ttattatttc      2520
     aggaaagcaa ttggaagatg ggcgcacgct ctctgattac aacatccaga aggaatctac      2580
     tcttcacttg gttctccgtc tccgaggagg aatgcagatc ttcgtcaaga cattgactgg      2640
     aaagaccatc acacttgaag tcgaagcctc tgataccatc gagaatgtga aggccaagat      2700
     tcaagacaag gaaggaatcc caccagatca gcagagactc atcttcgccg gaaaacaact      2760
     cgaagacggt cgtaccctct ccgactacaa catccaaaag gagtctactc ttcatttggt      2820
     tctccgtctg agaggaggta tgcagatctt cgtcaagact cttactggaa agaccatcac      2880
     acttgaagtc gaagcctctg ataccatcga gaatgtgaag gccaagattc aagacaagga      2940
     aggaatccca ccagatcagc agcgcttgat cttcgccgga aaacaacttg aagacggtcg      3000
     taccctttcc gactacaaca ttcaaaagga gtctactctt catttggttc tccgtctgag      3060
     aggaggtatg cagatcttcg tcaagacatt gaccggaaag accatcaccc tcgaagtcga      3120
     agcctccgac accatcgaaa atgtcaaggc caagatccaa gacaaggaag gaattccacc      3180
     agatcagcag agacttattt tcgctggtga gttcatattg ttttagaatt aaaactaatt      3240
     tttattgttt ttcaggaaag caactcgagg atggccgtac cctttcggac tacaatatcc      3300
     agaaggagtc tactcttcat ttggtgctcc gtctcagagg aggtatgcag atcttcgtca      3360
     agactttgac tggaaaaacc atcactctcg aggtcgaagc ttcggacacc attgagaatg      3420
     tcaaagccaa aatccaggat aaggagggaa tcccaccaga tcagcaacgt ttgatctttg      3480
     ctggaaagca gctcgaggat ggacgcactc tatccgatta caacatccaa aagcagtcga      3540
     cacttcatct cgttcttcgt cttcgcggag gagacattta aatcgaaccc atcaattcac      3600
     tcgttattcc tcctcgagat ctccgttcaa gtaacaatta tttattcttt attcttcggg      3660
     aatttctgta ttttaatgaa cgagctctga ataaattcat tttcgtgtac tcaaacgatt      3720
     tatctttatc tttaacaata acaaacaaca aagataacta ctctatgaag tgtaaggttc      3780
     aactatattt atagatc                                                     3797
//

If you have problems or comments...

PBIL Back to PBIL home page