(data stored in ACNUC9306 zone)

EMBL: AK159717

ID   AK159717; SV 1; linear; mRNA; HTC; MUS; 2904 BP.
XX
AC   AK159717;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus osteoclast-like cell cDNA, RIKEN full-length enriched library,
DE   clone:I420028O12 product:villin 2, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2904
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; aacf834827d24a9d02dcca03fb814ba0.
DR   Ensembl-Gn; ENSMUSG00000052397; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0023117; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0023085; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0023052; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0023088; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0022849; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0023532; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0022364; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0022815; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0022952; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0022926; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0023020; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0022944; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0023555; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0022111; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0022418; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000064234; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0045456; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0045435; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0045399; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0045407; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0045153; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0045889; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0045269; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0045056; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0045207; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0045172; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0045285; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0045148; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0045954; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0044852; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0044534; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by Takashi Ishikawa  ( Department of Surgery
CC   2 Yokohama City University 3-9 Fukuura,Kanazawa-ku,Yokohama
CC   236-0004 Japan ) whose assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I420028O12
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2904
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I420028O12"
FT                   /cell_type="osteoclast-like cell"
FT                   /db_xref="taxon:10090"
FT   CDS_pept        116..1876
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="villin 2 (MGD|MGI:98931 GB|BC048181, evidence:
FT                   BLASTN, 99%, match=2885)"
FT                   /db_xref="GOA:Q4KML7"
FT                   /db_xref="InterPro:IPR000299"
FT                   /db_xref="InterPro:IPR000798"
FT                   /db_xref="InterPro:IPR008954"
FT                   /db_xref="InterPro:IPR011174"
FT                   /db_xref="InterPro:IPR011259"
FT                   /db_xref="InterPro:IPR011993"
FT                   /db_xref="InterPro:IPR014352"
FT                   /db_xref="InterPro:IPR018979"
FT                   /db_xref="InterPro:IPR018980"
FT                   /db_xref="InterPro:IPR019747"
FT                   /db_xref="InterPro:IPR019748"
FT                   /db_xref="InterPro:IPR019749"
FT                   /db_xref="InterPro:IPR029071"
FT                   /db_xref="InterPro:IPR035963"
FT                   /db_xref="InterPro:IPR041789"
FT                   /db_xref="MGI:MGI:98931"
FT                   /db_xref="UniProtKB/TrEMBL:Q4KML7"
FT                   /protein_id="BAE35312.1"
FT                   /translation="MPKPINVRVTTMDAELEFAIQPNTTGKQLFDQVVKTIGLREVWYF
FT                   GLQYVDNKGFPTWLKLDKKVSAQEVRKENPVQFKFRAKFYPEDVAEELIQDITQKLFFL
FT                   QVKDGILSDEIYCPPETAVLLGSYAVQAKFGDYNKEMHKSGYLSSERLIPQRVMDQHKL
FT                   SRDQWEDRIQVWHAEHRGMLKDSAMLEYLKIAQDLEMYGINYFEIKNKKGTDLWLGVDA
FT                   LGLNIYEKDDKLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRINKRIL
FT                   QLCMGNHELYMRRRKPDTIEVQQMKAQAREEKHQKQLERQQLETEKKRRETVEREKEQM
FT                   LREKEELMLRLQDYEQKTKRAEKELSEQIEKALQLEEERRRAQEEAERLEADRMAALRA
FT                   KEELERQAQDQIKSQEQLAAELAEYTAKIALLEEARRRKEDEVEEWQHRAKEAQDDLVK
FT                   TKEELHLVMTAPPPPPPPVYEPVNYHVQEGLQDEGAEPMGYSAELSSEGILDDRNEEKR
FT                   ITEAEKNERVQRQLLTLSNELSQARDENKRTHNDIIHNENMRQGRDKYKTLRQIRQGNT
FT                   KQRIDEFEAM"
FT   regulatory      2886..2891
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2904
FT                   /note="putative"
XX
SQ   Sequence 2904 BP; 786 A; 732 C; 803 G; 583 T; 0 other;
     cagagggctt ccctccgcct gcaggtcccg acgcgtctcc gccgtcgccg tcgccgtcgc        60
     ctccgccgta cagccgaata gccgaggacc cacgcagagc caccaaccag ccaagatgcc       120
     caagccaatc aacgtccggg tgaccaccat ggatgccgag ctggagtttg ccatccagcc       180
     aaacaccacc gggaagcagc tctttgatca ggtggtaaag acgattggcc tccgggaagt       240
     gtggtacttc ggcctccagt atgtagacaa taaaggattt cctacctggc tgaaacttga       300
     taaaaaggtc tccgcacagg aggtccgaaa ggagaaccct gtccagttta aattccgggc       360
     caagttctac cccgaagacg tggccgagga actcatccag gacatcacac agaagctctt       420
     cttcctgcaa gtcaaggacg ggatcctcag cgacgagatc tactgccccc cagagacagc       480
     cgtgctcctg ggctcctatg ccgttcaggc caagttcgga gattataaca aggaaatgca       540
     caagtctggg tacctcagct cggagcggct gatcccccag agagtcatgg accaacacaa       600
     gctcagcagg gaccagtggg aggaccggat ccaggtgtgg cacgcggaac accgagggat       660
     gctcaaggac agtgctatgc tggaatacct gaagattgcc caggacctgg aaatgtatgg       720
     gatcaactat ttcgagatca aaaacaagaa aggaacagac ctttggcttg gagtcgatgc       780
     ccttggactt aacatttatg agaaagatga caagttgacc ccaaagatcg gcttcccttg       840
     gagtgagatc aggaacatct ctttcaacga caagaagttt gtcattaagc ccatcgacaa       900
     gaaggcacct gactttgtgt tctacgcccc gcgcctgaga attaacaagc ggatcctgca       960
     gctctgcatg gggaaccatg agctgtacat gcgccgcagg aagcccgaca ccatcgaggt      1020
     gcagcagatg aaggcccagg ctcgggagga gaagcaccag aagcagctag agcgacagca      1080
     gttggaaacc gagaagaaga ggcgagagac ggtggagaga gaaaaggagc agatgctccg      1140
     ggagaaggag gagctgatgc ttcggctgca ggactacgag cagaagacca agagggcgga      1200
     gaaagagctc tccgagcaga ttgagaaggc cctccaactg gaggaagaga ggaggcgagc      1260
     ccaggaggag gctgaacgtc tggaggccga ccgcatggcc gccctgcggg ccaaggaaga      1320
     actcgagaga caggcgcagg atcagataaa gagccaggag cagctggctg cagagctggc      1380
     agagtacacg gccaagatcg cactgctgga ggaggcgcgg aggcgcaagg aggacgaggt      1440
     agaagagtgg cagcaccggg ctaaagaagc ccaggacgac ctggtgaaga ccaaagagga      1500
     gctgcacctg gtgatgacgg ccccaccgcc cccaccgccc ccagtgtatg agcctgtgaa      1560
     ttaccacgtg caggagggac tgcaggacga gggagcagag cctatgggct acagtgccga      1620
     gctctccagt gagggcatcc tggatgaccg caacgaggag aagcggatca cagaggcaga      1680
     gaagaatgag cgcgtgcagc ggcagctgct gaccctgagc aatgagttgt cccaggcccg      1740
     ggatgagaac aagaggaccc acaatgacat catccacaac gagaacatgc ggcaaggcag      1800
     ggacaagtat aagacgctgc ggcaaatcag gcagggcaac accaagcaac gcattgacga      1860
     gttcgaggcc atgtagaggc caggctggga ccaagggcag agggcacctc actgcaggca      1920
     ggtgtcacac ttggctcttt agttctctta agtttagaca cccccttgct gtgttccagt      1980
     cccttaaaga gcagttacgg ggcctgcatt ctgccccgag acccagtggg ctcctccttg      2040
     gttccttcta attgtatcac atagtgccaa acaggtcaga tttaatgaca gttaccgaat      2100
     cacttcctgt ttggagcagg gattcggagg gctggccctc atgaagcaag attcattgtc      2160
     actgggacag cactgtggct cacgggtgcc atacttttct ctagttttac aatgagctca      2220
     aatcgatttt gttcttgatt tttatgaagg atccatctct gtgtattgag gggtaaaaat      2280
     gattttgaaa tttgagtcta aagcatgccc ccacagagtc tccttcctcc agaccgctgg      2340
     cagagtctcc aggggtcctt cagagtgtac ggtacaggac tctccgatac aaaattctca      2400
     tgcttatctg ttagcataca ttgttggact tatatatcta atgatctgct atagaggata      2460
     gtatttatat ataagcgata atatgggttt gtaacattag ttttaaaaaa gggaaagttt      2520
     tgttctgtat attttgttac cttttacaga ataaaagaat tcaacattaa gaaccatgta      2580
     accgagacac ttgatctgac acaggggcag tcgggaaacc gatgactgca gtaatcacca      2640
     ctgtacaaaa atgttagtgg gttttgtgca cgtaaaatgc acacttccat ttcctgtcag      2700
     tttcttattt gaaaaaaaaa aaatacacac cgagagcatc cagactccag ccctgaaggg      2760
     acgccggcct ctttattgtg agcaacttgt tcccgttttc agtttgtctt ccttcctttc      2820
     gcggactagc ctccttggtt gtgatgtctg tcctgtgcgg gtgactccgc accagattta      2880
     cacacaataa aagctctcca tggc                                             2904
//

If you have problems or comments...

PBIL Back to PBIL home page