(data stored in ACNUC9435 zone)

EMBL: BC053867

ID   BC053867; SV 1; linear; mRNA; STD; HUM; 2448 BP.
XX
AC   BC053867;
XX
DT   19-JUN-2003 (Rel. 76, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 5)
XX
DE   Homo sapiens SET binding factor 2, mRNA (cDNA clone IMAGE:5198438), partial
DE   cds.
XX
KW   .
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-2448
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2448
RA   Director MGC Project;
RT   ;
RL   Submitted (13-JUN-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Cancer
RL   Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03,
RL   Bethesda, MD 20892-2590, USA
XX
DR   MD5; 32d9b18d91767f77cdfba8d285a601e4.
DR   Ensembl-Gn; ENSG00000133812; homo_sapiens.
DR   Ensembl-Tr; ENST00000256190; homo_sapiens.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Life Technologies, Inc.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: National Institutes of Health Intramural
CC   Sequencing Center (NISC),
CC   Gaithersburg, Maryland;
CC   Web site: http://www.nisc.nih.gov/
CC   Contact: nisc_mgc@nhgri.nih.gov
CC   Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
CC   Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
CC   Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
CC   Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
CC   Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
CC   McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
CC   Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
CC   Young,A., Zhang,L.-H. and Green,E.D.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 115 Row: d Column: 18
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: GenomeScan gene
CC   prediction.
CC   Differences found between this sequence and the human genome (build
CC   35) are described in misc_difference features below and these
CC   differences were also compared to chimpanzee genomic seqeunces
CC   available as of Sep 03, 03.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2448
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_114"
FT                   /clone="IMAGE:5198438"
FT                   /tissue_type="Brain, adult, 6 pooled whole brains"
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:9606"
FT   gene            <1..2448
FT                   /gene="SBF2"
FT                   /note="synonyms: MTMR13, KIAA1766, CMT4B2"
FT   CDS_pept        <1..766
FT                   /codon_start=2
FT                   /gene="SBF2"
FT                   /product="SBF2 protein"
FT                   /db_xref="GOA:Q86WG5"
FT                   /db_xref="H-InvDB:HIT000259356.13"
FT                   /db_xref="HGNC:HGNC:2135"
FT                   /db_xref="InterPro:IPR001194"
FT                   /db_xref="InterPro:IPR001849"
FT                   /db_xref="InterPro:IPR004182"
FT                   /db_xref="InterPro:IPR005112"
FT                   /db_xref="InterPro:IPR005113"
FT                   /db_xref="InterPro:IPR010569"
FT                   /db_xref="InterPro:IPR011993"
FT                   /db_xref="InterPro:IPR022096"
FT                   /db_xref="InterPro:IPR029021"
FT                   /db_xref="InterPro:IPR030564"
FT                   /db_xref="InterPro:IPR030567"
FT                   /db_xref="InterPro:IPR037516"
FT                   /db_xref="InterPro:IPR037823"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q86WG5"
FT                   /protein_id="AAH53867.1"
FT                   /translation="WMMLTPKHFPSEDSDLAGEAGPRSQRRTVWPCYDDVSCTQPDALT
FT                   SLFSEIEKLEHKLNQAPEKWQQLWERVTVDLKEEPRTDRSQRHLSRSPGIVSTNLPSYQ
FT                   KRSLLHLPDSSMGEEQNSSISPSNGVERRAATLYSQYTSKNDENRSFEGTLYKRGALLK
FT                   GWKPRWFVLDVTKHQLRYYDSGEDTSCKGHIDLAEVEMVIPAGPSMGAPKHTSDKAFFD
FT                   LKTSKRVYNFCAQDGQSAQQWMDKIQSCISDA"
FT   misc_difference 1280
FT                   /gene="SBF2"
FT                   /note="'T' in cDNA is 'C' in the human genome. The
FT                   chimpanzee genome agrees with the human genomic sequence
FT                   and not the cDNA."
FT   misc_difference 1910
FT                   /gene="SBF2"
FT                   /note="'G' in cDNA is 'A' in the human genome. The
FT                   chimpanzee genome agrees with the human genomic sequence
FT                   and not the cDNA."
FT   misc_difference 2130
FT                   /gene="SBF2"
FT                   /note="'T' in cDNA is 'C' in the human genome."
FT   misc_difference 2281
FT                   /gene="SBF2"
FT                   /note="'A' in cDNA is 'G' in the human genome. The
FT                   chimpanzee genome agrees with the human genomic sequence
FT                   and not the cDNA."
FT   misc_difference 2392..2448
FT                   /gene="SBF2"
FT                   /note="polyA tail: 57 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 2448 BP; 783 A; 522 C; 485 G; 658 T; 0 other;
     ctggatgatg ctaaccccca agcacttccc ctccgaagac tctgacctgg ctggagaagc        60
     tgggccacgg agccagagga gaacagtgtg gccatgctat gatgatgtca gctgtactca       120
     gcctgatgct ctcaccagcc ttttcagtga aattgaaaaa ttggagcaca aattgaacca       180
     agcccctgag aagtggcagc agctgtggga aagggtaacc gtggacctta aagaagaacc       240
     aagaacagat cgctcccaaa gacacctgtc gagatcccca ggaattgtgt ctaccaacct       300
     accttcctat cagaagaggt ctctgctaca tctcccagac agcagcatgg gggaggaaca       360
     gaattccagc atctccccat ccaatggagt ggagcgaaga gcagccacgc tctatagcca       420
     gtatacatcc aagaatgatg aaaacaggtc ctttgaggga acactttata aaagaggggc       480
     tttgctgaaa ggttggaagc cccgttggtt tgttttggat gtaacaaaac atcagctgcg       540
     ctactatgac tcaggtgagg acacaagctg taaaggccac attgatctgg ctgaagtaga       600
     aatggtcatc cctgctggcc ccagcatggg agccccaaag cacacaagtg acaaggcttt       660
     ctttgatctc aagaccagca aacgtgtgta taacttctgc gcccaggatg gacagagtgc       720
     ccagcaatgg atggacaaga tccagagttg tatctctgat gcctgatgcc catggtcaac       780
     ccacgcagaa gaaacagaag aactcatgct gccagataga tagaaaaaga agcatggatc       840
     cttgaggagc tgacaacaag ttatcccagg gcctgaggtt ctcctgccca gtcccctctt       900
     gcaggggttg ctatatctac ttaacctgaa taggtgtttc acacaggtct ggtcaacagc       960
     cccatgcact ccctgtatct tgcactaaat ttttctaaca gggtcttagt ggttaatgat      1020
     cagaagatgt ctcctgagcc aactgtgaac ctcacccagg caaaatggct accacctact      1080
     tgggtccttc ttcatgaaag ctatagatcc ttttttgttc tctgaggtca taatttcctc      1140
     ggagacctgt ttaacaagca aaaatcaaaa ccctccaaga ttgtctcata ttctacctgg      1200
     actaggtttc ctatgagaga catctacttg taatgcctga cctttgagat gctcagttct      1260
     ctggtgctgc caaaagatgt ttccatggtc cgtgctctgc cagtggggtt cacaacaaga      1320
     gacgtcattg ttcagtagca ggcaaagagg gagcacacag cattattctg atggaaaaag      1380
     attatccagg gaatggtaca acaatgacca gcccaatgca ggaaaacact acttccaaaa      1440
     cactgaattc tctagaccag aggtgctctg aggatccagg gccttgtgtt cttatgtatc      1500
     ttctgcttcc tgacagcttc tttttcaaaa taacatgcaa aaaaagctga atgcactaac      1560
     tcacaaaaca aacacttgca ctgaattccc aatgaagtga agatgttgga aagacagagg      1620
     ccagctattt aggaccatac gcacctgtga caagggctgt gttgaccaca gtcacactgt      1680
     ggcatgactg gatacccaaa ctacacttct acacatgaaa agtaagaact gtctttagat      1740
     tttctttact ttgataactt gtgattgttt agcttaagac ccaagaaatg ctgtttgctc      1800
     atggtaaaca gaaacagcat cttcgctaca accactgaca ccagctggcg tcataggtag      1860
     ctagatcatt gcatttgttt tgaaatgtta atatgttaaa tactaaactg atatttcaaa      1920
     aatgtgtata tatgatttct atatccttgt ttttcagata gcctgcttat aatttaatat      1980
     aaattaactg atgcattcat aagatttcaa taatgaaatg gttccctttt aaaaaataaa      2040
     aatactttgt agattaaaaa taaatcagaa tttcaaattt aaaattgtcc acacactagg      2100
     aaatagaact gtgttaatat ataagaaatt tggggataat taagaatgaa ggacttttct      2160
     atcatttcca ttttataaat tgccacctgt gaaaatggtt tttgcacatt atttgtattt      2220
     ttctttgtat atgaaataat tttttgtact ttgtaaaata tggagcccat tgtaccttca      2280
     actatttgag actatacaca gtgcttcttt tgtaactgga ttactttaac tttcgtgaag      2340
     gcattacatt gcctcacatt cactaaccac cttgaattaa atttatttct taaaaaaaaa      2400
     aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa                   2448
//

If you have problems or comments...

PBIL Back to PBIL home page