(data stored in ACNUC9306 zone)

EMBL: AK168840

ID   AK168840; SV 1; linear; mRNA; HTC; MUS; 2903 BP.
XX
AC   AK168840;
XX
DT   09-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 10)
XX
DE   Mus musculus 17 days pregnant adult female amnion cDNA, RIKEN full-length
DE   enriched library, clone:I920059K06 product:villin 2, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2903
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (14-APR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 75049db255506b6df4e9a0bdf40d3cd3.
DR   Ensembl-Gn; ENSMUSG00000052397; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0023117; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0023085; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0023052; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0023088; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0022849; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0023532; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0022364; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0022815; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0022952; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0022926; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0023020; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0022944; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0023555; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0022111; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0022418; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000064234; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0045456; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0045435; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0045399; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0045407; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0045153; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0045889; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0045269; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0045056; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0045207; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0045172; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0045285; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0045148; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0045954; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0044852; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0044534; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I920059K06
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2903
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="female"
FT                   /dev_stage="17 days pregnant adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I920059K06"
FT                   /tissue_type="amnion"
FT                   /db_xref="taxon:10090"
FT   CDS_pept        114..1874
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="villin 2 (MGD|MGI:98931 GB|BC048181, evidence:
FT                   BLASTN, 99%, match=2889)"
FT                   /db_xref="GOA:Q4KML7"
FT                   /db_xref="InterPro:IPR000299"
FT                   /db_xref="InterPro:IPR000798"
FT                   /db_xref="InterPro:IPR008954"
FT                   /db_xref="InterPro:IPR011174"
FT                   /db_xref="InterPro:IPR011259"
FT                   /db_xref="InterPro:IPR011993"
FT                   /db_xref="InterPro:IPR014352"
FT                   /db_xref="InterPro:IPR018979"
FT                   /db_xref="InterPro:IPR018980"
FT                   /db_xref="InterPro:IPR019747"
FT                   /db_xref="InterPro:IPR019748"
FT                   /db_xref="InterPro:IPR019749"
FT                   /db_xref="InterPro:IPR029071"
FT                   /db_xref="InterPro:IPR035963"
FT                   /db_xref="InterPro:IPR041789"
FT                   /db_xref="MGI:MGI:98931"
FT                   /db_xref="UniProtKB/TrEMBL:Q4KML7"
FT                   /protein_id="BAE40663.1"
FT                   /translation="MPKPINVRVTTMDAELEFAIQPNTTGKQLFDQVVKTIGLREVWYF
FT                   GLQYVDNKGFPTWLKLDKKVSAQEVRKENPVQFKFRAKFYPEDVAEELIQDITQKLFFL
FT                   QVKDGILSDEIYCPPETAVLLGSYAVQAKFGDYNKEMHKSGYLSSERLIPQRVMDQHKL
FT                   SRDQWEDRIQVWHAEHRGMLKDSAMLEYLKIAQDLEMYGINYFEIKNKKGTDLWLGVDA
FT                   LGLNIYEKDDKLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRINKRIL
FT                   QLCMGNHELYMRRRKPDTIEVQQMKAQAREEKHQKQLERQQLETEKKRRETVEREKEQM
FT                   LREKEELMLRLQDYEQKTKRAEKELSEQIEKALQLEEERRRAQEEAERLEADRMAALRA
FT                   KEELERQAQDQIKSQEQLAAELAEYTAKIALLEEARRRKEDEVEEWQHRAKEAQDDLVK
FT                   TKEELHLVMTAPPPPPPPVYEPVNYHVQEGLQDEGAEPMGYSAELSSEGILDDRNEEKR
FT                   ITEAEKNERVQRQLLTLSNELSQARDENKRTHNDIIHNENMRQGRDKYKTLRQIRQGNT
FT                   KQRIDEFEAM"
FT   regulatory      2884..2889
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2903
FT                   /note="putative"
XX
SQ   Sequence 2903 BP; 788 A; 732 C; 802 G; 581 T; 0 other;
     gagggcttcc ctccgcctgc aggtcccgac gcgtctccgc cgtcgccgtc gccgtcgcct        60
     ccgccgtaca gccgaatagc cgaggaccca cgcagagcca ccaaccagcc aagatgccca       120
     agccaatcaa cgtccgggtg accaccatgg atgccgagct ggagtttgcc atccagccaa       180
     acaccaccgg gaagcagctc tttgatcagg tggtaaagac gattggcctc cgggaagtgt       240
     ggtacttcgg cctccagtat gtagacaata aaggatttcc tacctggctg aaacttgata       300
     aaaaggtctc cgcacaggag gtccgaaagg agaaccctgt ccagtttaaa ttccgggcca       360
     agttctaccc cgaagacgtg gccgaggaac tcatccagga catcacacag aagctcttct       420
     tcctgcaagt caaggacggg atcctcagcg acgagatcta ctgcccccca gagacagccg       480
     tgctcctggg ctcctatgcc gttcaggcca agttcggaga ttataacaag gaaatgcaca       540
     agtctgggta cctcagctcg gagcggctga tcccccagag agtcatggac caacacaagc       600
     tcagcaggga ccagtgggag gaccggatcc aggtgtggca cgcggaacac cgagggatgc       660
     tcaaggacag tgctatgcta gaatacctga agattgccca ggacctggaa atgtatggga       720
     tcaactattt cgagatcaaa aacaagaaag gaacagacct ttggcttgga gtcgatgccc       780
     ttggacttaa catttatgag aaagatgaca agttgacccc aaagatcggc ttcccttgga       840
     gtgagatcag gaacatctct ttcaacgaca agaagtttgt cattaagccc atcgacaaga       900
     aggcacctga ctttgtgttc tacgccccgc gcctgagaat taacaagcgg atcctgcaac       960
     tctgcatggg gaaccatgag ctgtacatgc gccgcaggaa gcccgacacc atcgaggtgc      1020
     agcagatgaa ggcccaggct cgggaggaga agcaccagaa gcagctagag cgacagcagt      1080
     tggaaaccga gaagaagagg cgagagacgg tggagagaga aaaggagcag atgctccggg      1140
     agaaggagga gctgatgctt cggctgcagg actacgagca gaagaccaag agggcggaga      1200
     aagagctctc cgagcagatt gagaaggccc tccaactgga ggaagagagg aggcgagccc      1260
     aggaggaggc tgaacgtctg gaggccgacc gcatggccgc cctgcgggcc aaggaagaac      1320
     tcgagagaca ggcgcaggat cagataaaga gccaggagca gctggctgca gagctggcag      1380
     agtacacggc caagatcgca ctgctggagg aggcgcggag gcgcaaggag gacgaggtag      1440
     aagagtggca gcaccgggct aaagaagccc aggacgacct ggtgaagacc aaagaggagc      1500
     tgcacctggt gatgacggcc ccaccgcccc caccgccccc agtgtatgag cctgtgaatt      1560
     accacgtgca ggagggactg caggacgagg gagcagagcc tatgggctac agtgccgagc      1620
     tctccagtga gggcatcctg gatgaccgca acgaggagaa gcggatcaca gaggcagaga      1680
     agaatgagcg cgtgcagcgg cagctgctga ccctgagcaa tgagttgtcc caggcccggg      1740
     atgagaacaa gaggacccac aatgacatca tccacaacga gaacatgcgg caaggcaggg      1800
     acaagtataa gacgctgcgg caaatcaggc agggcaacac caagcaacgc attgacgagt      1860
     tcgaggccat gtagaggcca ggctgggacc aagggcagag ggcacctcac tgcaggcagg      1920
     tgtcacactt ggctctttag ttctcttaag tttagacacc cccttgctgt gttccagtcc      1980
     cttaaagagc agttacgggg cctgcattct gccccgagac ccagtgggct cctccttggt      2040
     tccttctaat tgtatcacat agtgccaaac aggtcagatt taatgacagt taccgaatca      2100
     cttcctgttt ggagcaggga ttcggagggc tggccctcat gaagcaagat tcattgtcac      2160
     tgggacagca ctgtggctca cgggtgccat acttttctct agttttacaa tgagctcaaa      2220
     tcgattttgt tcttgatttt tatgaaggat ccatctctgt gtattgaggg gtaaaaatga      2280
     ttttgaaatt tgagtctaaa gcatgccccc acagagtctc cttcctccag accgctggca      2340
     gagtctccag gggtccttca gagtgtacgg tacaggactc tccgatacaa aattctcatg      2400
     cttatccgtt agcatacatt gttggactta tatatctaat gatctgctat agaggatagt      2460
     atttatatat aagcgataat atgggtttgt aacattagtt ttaaaaaagg gaaagttttg      2520
     ttctgtatat tttgttacct tttacagaat aaaagaattc aacattaaga accatgtaac      2580
     cgagacactt gatctgacac aggggcagtc gggaaaccga tgactgcagt aatcaccact      2640
     gtacaaaaat gttagtgggt tttgtgcacg taaaatgcac acttccattt cctgtcagtt      2700
     tcttatttga aaaaaaaaaa atacacaccg agagcatcca gactccagcc ctgaagggac      2760
     gccggcctct ttattgtgag caacttgttc ccgttttcag tttgtcttcc ttcctttcgc      2820
     ggactagcct ccttggttgt gatgtctgtc ctgtgcgggt gactccgcac cagatttaca      2880
     cacaataaaa gctctccaag gcg                                              2903
//

If you have problems or comments...

PBIL Back to PBIL home page