(data stored in ACNUC9306 zone)

EMBL: AK172336

ID   AK172336; SV 1; linear; mRNA; HTC; MUS; 2573 BP.
XX
AC   AK172336;
XX
DT   09-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 10)
XX
DE   Mus musculus activated spleen cDNA, RIKEN full-length enriched library,
DE   clone:F830203B19 product:villin 2, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2573
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (14-APR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; ec7bffbc6e66e2fe581719dd35b3f850.
DR   Ensembl-Gn; ENSMUSG00000052397; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0023117; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0023085; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0023052; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0023088; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0022849; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0023532; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0022364; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0022815; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0022952; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0022926; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0023020; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0022944; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0023555; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0022111; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0022418; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000064234; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0045456; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0045435; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0045399; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0045407; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0045153; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0045889; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0045269; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0045056; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0045207; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0045172; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0045285; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0045148; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0045954; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0044852; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0044534; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by Dr. John Todd (Dept. of Medical Genetics
CC   Wellcome Trust Centre for Molecular Mechanisms in Disease Wellcome
CC   Trust/MRC building Addenbrookes Hospital Cambridge) whose
CC   assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=F830203B19
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2573
FT                   /organism="Mus musculus"
FT                   /strain="NOD"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="F830203B19"
FT                   /tissue_type="activated spleen"
FT                   /db_xref="taxon:10090"
FT   CDS_pept        121..1881
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="villin 2 (MGD|MGI:98931 GB|AK002766, evidence:
FT                   BLASTN, 99%, match=2570)"
FT                   /db_xref="GOA:Q4KML7"
FT                   /db_xref="InterPro:IPR000299"
FT                   /db_xref="InterPro:IPR000798"
FT                   /db_xref="InterPro:IPR008954"
FT                   /db_xref="InterPro:IPR011174"
FT                   /db_xref="InterPro:IPR011259"
FT                   /db_xref="InterPro:IPR011993"
FT                   /db_xref="InterPro:IPR014352"
FT                   /db_xref="InterPro:IPR018979"
FT                   /db_xref="InterPro:IPR018980"
FT                   /db_xref="InterPro:IPR019747"
FT                   /db_xref="InterPro:IPR019748"
FT                   /db_xref="InterPro:IPR019749"
FT                   /db_xref="InterPro:IPR029071"
FT                   /db_xref="InterPro:IPR035963"
FT                   /db_xref="InterPro:IPR041789"
FT                   /db_xref="MGI:MGI:98931"
FT                   /db_xref="UniProtKB/TrEMBL:Q4KML7"
FT                   /protein_id="BAE42954.1"
FT                   /translation="MPKPINVRVTTMDAELEFAIQPNTTGKQLFDQVVKTIGLREVWYF
FT                   GLQYVDNKGFPTWLKLDKKVSAQEVRKENPVQFKFRAKFYPEDVAEELIQDITQKLFFL
FT                   QVKDGILSDEIYCPPETAVLLGSYAVQAKFGDYNKEMHKSGYLSSERLIPQRVMDQHKL
FT                   SRDQWEDRIQVWHAEHRGMLKDSAMLEYLKIAQDLEMYGINYFEIKNKKGTDLWLGVDA
FT                   LGLNIYEKDDKLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRINKRIL
FT                   QLCMGNHELYMRRRKPDTIEVQQMKAQAREEKHQKQLERQQLETEKKRRETVEREKEQM
FT                   LREKEELMLRLQDYEQKTKRAEKELSEQIEKALQLEEERRRAQEEAERLEADRMAALRA
FT                   KEELERQAQDQIKSQEQLAAELAEYTAKIALLEEARRRKEDEVEEWQHRAKEAQDDLVK
FT                   TKEELHLVMTAPPPPPPPVYEPVNYHVQEGLQDEGAEPMGYSAELSSEGILDDRNEEKR
FT                   ITEAEKNERVQRQLLTLSNELSQARDENKRTHNDIIHNENMRQGRDKYKTLRQIRQGNT
FT                   KQRIDEFEAM"
FT   regulatory      2555..2560
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2573
FT                   /note="putative"
XX
SQ   Sequence 2573 BP; 698 A; 646 C; 734 G; 495 T; 0 other;
     gaggggcttc cctccgcctg caggtcccga cgcgtctccg ccgtcgccgt cgccgtcgcc        60
     gtcgtctccg ccgtacagcc gaatagccga ggacccacgc agagccacca accagccaag       120
     atgcccaagc caatcaacgt ccgggtgacc accatggatg ccgagctgga gtttgccatc       180
     cagccaaaca ccaccgggaa gcagctcttt gatcaggtgg taaagacgat tggcctccgg       240
     gaagtgtggt acttcggcct ccagtatgta gacaataaag gatttcctac ctggttgaaa       300
     cttgataaaa aggtctccgc acaggaggtc cgaaaggaga accctgtcca gtttaaattc       360
     cgggccaagt tctaccccga agacgtggcc gaggaactca tccaggacat cacacagaag       420
     ctcttcttcc tgcaagtcaa ggacgggatc ctcagcgacg agatctactg ccccccagag       480
     acagccgtgc tcctgggctc ctatgccgtt caggccaagt tcggagatta taacaaggaa       540
     atgcacaagt ctgggtacct cagctcggag cggctgatcc cccagagagt catggaccaa       600
     cacaagctca gcagggacca gtgggaggac cggatccagg tgtggcacgc ggaacaccga       660
     gggatgctca aggacagtgc tatgctagaa tacctgaaga ttgcccagga cctggaaatg       720
     tatgggatca actatttcga gatcaaaaac aagaaaggaa cagacctttg gcttggagtc       780
     gatgcccttg gacttaacat ttatgagaaa gatgacaagt tgaccccaaa gatcggcttc       840
     ccttggagtg agatcaggaa catctctttc aacgacaaga agtttgtcat taagcccatc       900
     gacaagaagg cacctgactt tgtgttctac gccccgcgcc tgagaattaa caagcggatc       960
     ctgcagctct gcatggggaa ccatgagctg tacatgcgcc gcaggaagcc cgacaccatc      1020
     gaggtgcagc agatgaaggc ccaggctcgg gaggagaagc accagaagca gctagagcga      1080
     cagcagttgg aaaccgagaa gaagaggcga gagacggtgg agagagaaaa ggagcagatg      1140
     ctccgggaga aggaggagct gatgcttcgg ctgcaggact acgagcagaa gaccaagagg      1200
     gcggagaaag agctctccga gcagattgag aaggccctcc aactggagga agagaggagg      1260
     cgagcccagg aggaggctga acgtctggag gccgaccgca tggccgccct gcgggccaag      1320
     gaagaactcg agagacaggc gcaggatcag ataaagagcc aggagcagct ggctgcagag      1380
     ctggcagagt acacggccaa gatcgcactg ctggaggagg cgcggaggcg caaggaggac      1440
     gaggtagaag agtggcagca ccgggctaaa gaagcccagg acgacctggt gaagaccaaa      1500
     gaggagctgc acctggtgat gacggcccca ccgcccccac cgcccccagt gtatgagcct      1560
     gtgaattacc acgtgcagga gggactgcag gacgagggag cagagcctat gggctacagt      1620
     gccgagctct ccagtgaggg catcctggat gaccgcaacg aggagaagcg gatcacagag      1680
     gcagagaaga atgagcgcgt gcagcggcag ctgctgaccc tgagcaatga gttgtcccag      1740
     gcccgggatg agaacaagag gacccacaat gacatcatcc acaacgagaa catgcggcaa      1800
     ggcagggaca agtataagac gctgcggcaa atcaggcagg gcaacaccaa gcaacgcatt      1860
     gacgagttcg aggccatgta gaggccaggc tgggaccaag ggcagagggc acctcactgc      1920
     aggcaggtgt cacacttggc tctttagttc tcttaagttt agacaccccc ttgctgtgtt      1980
     ccagtccctt aaagagcagt tacggggcct gcattctgcc ccgagaccca gtgggctcct      2040
     ccttggttcc ttctaattgt atcacatagt gccaaacagg tcagatttaa tgacagttac      2100
     cgaatcactt cctgtttgga gcagggattc ggagggctgg ccctcatgaa gcaagattca      2160
     ttgtcactgg gacagcactg tggctcacgg gtgccatact tttctctagt tttacaatga      2220
     gctcaaatcg attttgttct tgatttttat gaaggatcca tctctgtgta ttgaggggta      2280
     aaaatgattt tgaaatttga gtctaaagca tgcccccaca gagtctcctt cctccagacc      2340
     gctggcagag tctccagggg tccttcagag tgtacggtac aggactctcc gatacaaaat      2400
     tctcatgctt atctgttagc atacattgtt ggacttatat atctaatgat ctgctataga      2460
     ggatagtatt tatatataag cgataatatg ggtttgtaac attagtttta aaaaagggaa      2520
     agttttgttc tgtatatttt gttacctttt acagaataaa agaattcaac att             2573
//

If you have problems or comments...

PBIL Back to PBIL home page