(data stored in ACNUC24361 zone)

EMBL: BC030790

ID   BC030790; SV 2; linear; mRNA; STD; HUM; 3126 BP.
XX
AC   BC030790;
XX
DT   05-JUN-2002 (Rel. 72, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 7)
XX
DE   Homo sapiens zinc finger with KRAB and SCAN domains 5, mRNA (cDNA clone
DE   MGC:33710 IMAGE:4827870), complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-3126
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-3126
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (31-MAY-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; e6ce43fe8024dade9b2f858f083b471e.
XX
CC   On Aug 25, 2003 this sequence version replaced gi:21314978.
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
CC   cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
CC   Toshiyuki and Piero Carninci (RIKEN)
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Institute for Systems Biology
CC   http://www.systemsbiology.org
CC   contact: amadan@systemsbiology.org
CC   Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
CC   Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 47 Row: a Column: 9
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 21483181.
CC   Differences found between this sequence and the human reference
CC   genome (build 36) are described in misc_difference features below
CC   and these differences were also compared to chimpanzee genome
CC   (build 1).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3126
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_97"
FT                   /clone="MGC:33710 IMAGE:4827870"
FT                   /tissue_type="Testis"
FT                   /note="Vector: pBluescriptR"
FT                   /db_xref="taxon:9606"
FT   gene            1..3126
FT                   /gene="ZKSCAN5"
FT   CDS_pept        162..2681
FT                   /codon_start=1
FT                   /gene="ZKSCAN5"
FT                   /product="ZKSCAN5 protein"
FT                   /db_xref="GOA:Q8N718"
FT                   /db_xref="H-InvDB:HIT000041135.17"
FT                   /db_xref="InterPro:IPR001909"
FT                   /db_xref="InterPro:IPR003309"
FT                   /db_xref="InterPro:IPR013087"
FT                   /db_xref="InterPro:IPR036051"
FT                   /db_xref="InterPro:IPR036236"
FT                   /db_xref="InterPro:IPR038269"
FT                   /db_xref="UniProtKB/TrEMBL:Q8N718"
FT                   /protein_id="AAH30790.1"
FT                   /translation="MIMTESREVIDLDPPAETSQEQEDLFIVKVEEEDCTWMQEYNPPT
FT                   FETFYQRFRHFQYHEASGPREALSQLRVLCCEWLRPELHTKEQILELLVLEQFLTILPE
FT                   EFQPWVREHHPESGEEAVAVIENIQRELEERRQQIVACPDVLPRKMATPGAVQESCSPH
FT                   PLTVDTQPEQAPQKPRLLEENALPVLQVPSLPLKDSQELTASLLSTGSQKLVKIEEVAD
FT                   VAVSFILEEWGHLDQSQKSLYRDDRKENYGSITSMGYESRDNMELIVKQISDDSESHWV
FT                   APEHTERSVPQDPDFAEVSDLKGMVQRWQVNPTVGKSRQNPSQKRDLDAITDISPKQST
FT                   HGERGHRCSDCGKFFLQASNFIQHRRIHTGEKPFKCGECGKSYNQRVHLTQHQRVHTGE
FT                   KPYKCQVCGKAFRVSSHLVQHHSVHSGERPYGCNECGKNFGRHSHLIEHLKRHFREKSQ
FT                   RCSDKRSKNTKLSVKKKISEYSEADMELSGKTQRNVSQVQDFGEGCEFQGKLDRKQGIP
FT                   MKEILGQPSSKRMNYSEVPYVHKKSSTGERPHKCNECGKSFIQSAHLIQHQRIHTGEKP
FT                   FRCEECGKSYNQRVHLTQHQRVHTGEKPYTCPLCGKAFRVRSHLVQHQSVHSGERPFKC
FT                   NECGKGFGRRSHLAGHLRLHSREKSHQCRECGEIFFQYVSLIEHQVLHMGQKNEKNGIC
FT                   EEAYSWNLTVIEDKKIELQEQPYQCDICGKAFGYSSDLIQHYRTHTAEKPYQCDICREN
FT                   VGQCSHTKQHQKIYSSTKSHQCHECGRGFTLKSHLNQHQGIHTGEKPFQCKECGMNFSW
FT                   SCSLFKHLRSHERTDPINTLSVEGSLL"
FT   misc_difference 2538
FT                   /gene="ZKSCAN5"
FT                   /note="'G' in cDNA is 'A' in the human genome; amino acid
FT                   difference: 'G' in cDNA, 'R' in the human genome. The
FT                   chimpanzee genome agrees with the human genomic sequence
FT                   and not the cDNA."
FT   misc_difference 3112..3126
FT                   /gene="ZKSCAN5"
FT                   /note="polyA tail: 15 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 3126 BP; 954 A; 711 C; 773 G; 688 T; 0 other;
     gactcgcggg tgtgacgttg aagatgtcgg ccttctgagc cgactgcggt ggtcaagagg        60
     ttgcatgcta ctgaaagtgt ccttcagaag atattaaaga gcagaaaaac aattgtttca       120
     gtgtaacaca gccagcctcg aagacttccc tctgagttgg aatgataatg accgaatccc       180
     gagaagttat agacttagac cccccagctg agacttccca ggagcaggaa gaccttttca       240
     tagtgaaggt ggaagaagaa gactgcacct ggatgcagga gtacaacccg ccaacgtttg       300
     agacttttta ccagcgcttc aggcacttcc agtaccatga ggcttcagga ccccgggagg       360
     ctctcagcca actccgggtg ctctgctgtg agtggctgag gcccgagctg cacacgaagg       420
     agcagatcct ggagctgctg gtgctggagc agttcctgac catcctgcct gaagagttcc       480
     agccctgggt gagggaacat caccctgaaa gtggagaaga ggcggtggcc gtgatagaaa       540
     atatacagcg agaacttgag gaacgcagac agcagattgt tgcctgccct gatgtgcttc       600
     ctcggaagat ggcaacacct ggagcagtgc aggagtcctg cagcccccat cccctgaccg       660
     tggacaccca gcctgagcaa gcgccacaga agcctcgtct cctggaggaa aatgcccttc       720
     ctgttctcca agttccttcc cttcccctga aggacagcca ggagctgaca gcttcacttc       780
     tctcaactgg gtcccagaag ttggtgaaaa ttgaagaggt ggctgatgtg gctgtatcct       840
     tcatcctgga ggaatggggg catttggacc agtcccagaa gtccctttat agggatgaca       900
     ggaaggagaa ctatgggagt attacttcca tgggttatga gtccagggac aatatggagc       960
     tcatagtgaa gcagatttct gatgactctg aatcacactg ggtggcgcca gaacacaccg      1020
     aaaggagcgt tcctcaggat ccagactttg cagaagtcag tgaccttaaa ggcatggtac      1080
     aaaggtggca ggtcaacccc actgtgggga aatcaaggca gaatccttcc cagaaaaggg      1140
     atctggatgc aatcacagac atcagcccta agcaaagcac acatggcgag agagggcaca      1200
     gatgcagcga ttgtggcaaa ttcttcctcc aagcctcaaa ctttattcag catcggcgca      1260
     tccacactgg agaaaaaccg tttaagtgcg gagaatgtgg gaagagctac aatcagcggg      1320
     tgcacctcac ccagcaccag cgcgtccaca caggggagaa accctacaaa tgtcaggtgt      1380
     gcggaaaggc tttccgggtg agttcccacc tggttcagca ccacagtgtc cacagcggag      1440
     agaggcccta tggctgcaat gagtgtggga agaacttcgg tcgccattcg catctgatcg      1500
     aacacctaaa acgccacttc agggagaaat cccagagatg cagtgacaaa agaagtaaga      1560
     acacaaaatt aagtgttaag aagaaaattt cagaatattc agaagcagac atggaactat      1620
     ctggaaaaac ccaaagaaat gtttctcaag ttcaagattt tggagaaggc tgtgagtttc      1680
     aaggcaagct ggatagaaag cagggaattc ccatgaaaga gatactagga caaccatctt      1740
     caaagaggat gaactacagt gaagtcccat atgtccacaa aaaatcctcc actggagaga      1800
     gaccacataa atgtaacgag tgtgggaaaa gcttcattca gagtgcacat cttattcaac      1860
     atcaaagaat acacactggg gagaaaccat tcaggtgtga ggaatgtggg aaaagctaca      1920
     accaacgcgt gcacctaact cagcatcagc gcgtccacac aggtgagaag ccctacacct      1980
     gtcccttatg tgggaaagcc ttcagagtga ggtcccacct tgttcagcat cagagcgtgc      2040
     acagtgggga gagacccttc aagtgtaacg aatgtgggaa aggctttggg aggcgttccc      2100
     acctggctgg acatcttcga ctccactccc gagagaaatc ccatcagtgt cgtgaatgtg      2160
     gggaaatctt ttttcagtac gttagcctaa ttgaacatca ggtgctccac atgggtcaga      2220
     aaaatgaaaa aaatggcatc tgtgaggaag catatagttg gaacttgaca gtgattgaag      2280
     acaagaagat tgagttacaa gagcagcctt atcagtgtga tatctgtgga aaagcctttg      2340
     gttatagctc agacctcatt cagcattaca gaactcatac agcagagaag ccctatcaat      2400
     gtgatatatg tagagaaaat gttggccagt gttcccacac caaacaacat caaaaaatct      2460
     actccagcac aaaatcccat caatgtcatg aatgtggcag aggcttcact ctgaagtcac      2520
     atcttaatca acatcaggga atccatactg gtgagaaacc ttttcaatgt aaagaatgtg      2580
     gaatgaattt cagctggagt tgtagcctct ttaaacacct gagaagccat gagaggacag      2640
     atcccataaa taccttaagt gtagaggggt ctctgttgta gaatagctct taattttaga      2700
     gaaaccttcc tggagggaaa ccatactcct ataatgagca aagtaacaac ttcaagcatt      2760
     tttccagcgt taccatcaaa ctcacaaata ggttgaaatc ctttagttat aactcagcct      2820
     ttaggaacac cggagaaccc acaataatag aaatcttttc gtgttcccca ttgagaaatg      2880
     ctttagttag catcttcatg cttggaaatc tagacaagaa gagaatccat ggatggacat      2940
     ggtcgaggaa ttcggaaagc ctgcagttga cattcagtct tcacttgaaa ctcaaaactg      3000
     acactaggaa cagcttcatg agttcagtag aagtaagctt tatttgtagc ttctgccttg      3060
     tttgacggcg tatctattca gggaagcgca cagtaaaaga attccttagc aaaaaaaaaa      3120
     aaaaaa                                                                 3126
//

If you have problems or comments...

PBIL Back to PBIL home page