(data stored in SCRATCH zone)

EMBL: BC062539

ID   BC062539; SV 1; linear; mRNA; STD; HUM; 2891 BP.
XX
AC   BC062539;
XX
DT   26-NOV-2003 (Rel. 77, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 13)
XX
DE   Homo sapiens Sp1 transcription factor, mRNA (cDNA clone MGC:71333
DE   IMAGE:5928633), complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-2891
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2891
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (24-NOV-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; b719f3cbb9d717b29be2f9ff453161a1.
DR   Ensembl-Gn; ENSG00000185591; homo_sapiens.
DR   Ensembl-Tr; ENST00000327443; homo_sapiens.
DR   Ensembl-Tr; ENST00000426431; homo_sapiens.
DR   EuropePMC; PMC4139505; 24881871.
DR   EuropePMC; PMC4202103; 24948597.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: ATCC
CC   cDNA Library Preparation: Rubin Laboratory
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: National Institutes of Health Intramural
CC   Sequencing Center (NISC),
CC   Gaithersburg, Maryland;
CC   Web site: http://www.nisc.nih.gov/
CC   Contact: nisc_mgc@nhgri.nih.gov
CC   Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
CC   Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
CC   Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
CC   Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
CC   Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
CC   McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
CC   Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
CC   Young,A., Zhang,L.-H. and Green,E.D.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAL Plate: 50 Row: j Column: 2
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 38372900.
CC   Differences found between this sequence and the human reference
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2891
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B-R"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_47"
FT                   /clone="MGC:71333 IMAGE:5928633"
FT                   /tissue_type="Brain, neuroblastoma"
FT                   /note="Vector: pOTB7"
FT                   /db_xref="taxon:9606"
FT   gene            1..2891
FT                   /gene="SP1"
FT   CDS_pept        99..2456
FT                   /codon_start=1
FT                   /gene="SP1"
FT                   /product="Sp1 transcription factor"
FT                   /db_xref="GOA:P08047"
FT                   /db_xref="H-InvDB:HIT000260452.14"
FT                   /db_xref="HGNC:HGNC:11205"
FT                   /db_xref="InterPro:IPR013087"
FT                   /db_xref="InterPro:IPR036236"
FT                   /db_xref="PDB:1SP1"
FT                   /db_xref="PDB:1SP2"
FT                   /db_xref="PDB:1VA1"
FT                   /db_xref="PDB:1VA2"
FT                   /db_xref="PDB:1VA3"
FT                   /db_xref="UniProtKB/Swiss-Prot:P08047"
FT                   /protein_id="AAH62539.1"
FT                   /translation="MSDQDHSMDEMTAVVKIEKGVGGNNGGNGNGGGAFSQARSSSTGS
FT                   SSSTGGGGQESQPSPLALLAATCSRIESPNENSNNSQGPSQSGGTGELDLTATQLSQGA
FT                   NGWQIISSSSGATPTSKEQSGSSTNGSNGSESSKNRTVSGGQYVVAAAPNLQNQQVLTG
FT                   LPGVMPNIQYQVIPQFQTVDGQQLQFAATGAQVQQDGSGQIQIIPGANQQIITNRGSGG
FT                   NIIAAMPNLLQQAVPLQGLANNVLSGQTQYVTNVPVALNGNITLLPVNSVSAATLTPSS
FT                   QAVTISSSGSQESGSQPVTSGTTISSASLVSSQASSSSFFTNANSYSTTTTTSNMGIMN
FT                   FTTSGSSGTNSQGQTPQRVSGLQGSDALNIQQNQTSGGSLQAGQQKEGEQNQQTQQQQI
FT                   LIQPQLVQGGQALQALQAAPLSGQTFTTQAISQETLQNLQLQAVPNSGPIIIRTPTVGP
FT                   NGQVSWQTLQLQNLQVQNPQAQTITLAPMQGVSLGQTSSSNTTLTPIASAASIPAGTVT
FT                   VNAAQLSSMPGLQTINLSALGTSGIQVHPIQGLPLAIANAPGDHGAQLGLHGAGGDGIH
FT                   DDTAGGEEGENSPDAQPQAGRRTRREACTCPYCKDSEGRGSGDPGKKKQHICHIQGCGK
FT                   VYGKTSHLRAHLRWHTGERPFMCTWSYCGKRFTRSDELQRHKRTHTGEKKFACPECPKR
FT                   FMRSDHLSKHIKTHQNKKGGPGVALSVGTLPLDSGAGSEGSGTATPSALITTNMVAMEA
FT                   ICPEGIARLANSGINVMQVADLQSINISGNGF"
FT   misc_difference 2794^2795
FT                   /gene="SP1"
FT                   /note="1 base in the human genome, T, is not found in
FT                   cDNA."
FT   misc_difference 2855..2891
FT                   /gene="SP1"
FT                   /note="polyA tail: 37 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 2891 BP; 790 A; 805 C; 709 G; 587 T; 0 other;
     ggtccgggtt cgcttgcctc gtcagcgtcc gcgtttttcc cggccccccc caaccccccc        60
     ggacaggacc cccttgagct tgtccctcag ctgccaccat gagcgaccaa gatcactcca       120
     tggatgaaat gacagctgtg gtgaaaattg aaaaaggagt tggtggcaat aatgggggca       180
     atggtaatgg tggtggtgcc ttttcacagg ctcgaagtag cagcacaggc agtagcagca       240
     gcactggagg aggagggcag gagtcccagc catccccttt ggctctgctg gcagcaactt       300
     gcagcagaat tgagtcaccc aatgagaaca gcaacaactc ccagggcccg agtcagtcag       360
     ggggaacagg tgagcttgac ctcacagcca cacaactttc acagggtgcc aatggctggc       420
     agatcatctc ttcctcctct ggggctaccc ctacctcaaa ggaacagagt ggcagcagta       480
     ccaatggcag caatggcagt gagtcttcca agaatcgcac agtctctggt gggcagtatg       540
     ttgtggctgc cgctcccaac ttacagaacc agcaagttct gacaggacta cctggagtga       600
     tgcctaatat tcagtatcaa gtaatcccac agttccagac cgttgatggg caacagctgc       660
     agtttgctgc cactggggcc caagtgcagc aggatggttc tggtcaaata cagatcatac       720
     caggtgcaaa ccaacagatt atcacaaatc gaggaagtgg aggcaacatc attgctgcta       780
     tgccaaacct actccagcag gctgtccccc tccaaggcct ggctaataat gtactctcag       840
     gacagactca gtatgtgacc aatgtaccag tggccctgaa tgggaacatc accttgctac       900
     ctgtcaacag cgtttctgca gctaccttga ctcccagctc tcaggcagtc acgatcagca       960
     gctctgggtc ccaggagagt ggctcacagc ctgtcacctc agggactacc atcagttctg      1020
     ccagcttggt atcatcacaa gccagttcca gctccttttt caccaatgcc aatagctact      1080
     caactactac taccaccagc aacatgggaa ttatgaactt tactaccagt ggatcatcag      1140
     ggaccaactc tcaaggccag acaccccaga gggtcagtgg gctacagggg tctgatgctc      1200
     tgaacatcca gcaaaaccag acatctggag gctcattgca agcaggccag caaaaagaag      1260
     gagagcaaaa ccagcagaca cagcagcaac aaattcttat ccagcctcag ctagttcaag      1320
     ggggacaggc cctccaggcc ctccaagcag caccattgtc agggcagacc tttacaactc      1380
     aagccatctc ccaggaaacc ctccagaacc tccagcttca ggctgttcca aactctggtc      1440
     ccatcatcat ccggacacca acagtggggc ccaatggaca ggtcagttgg cagactctac      1500
     agctgcagaa cctccaagtt cagaacccac aagcccaaac aatcacctta gccccaatgc      1560
     agggtgtttc cttggggcag accagcagca gcaacaccac tctcacaccc attgcctcag      1620
     ctgcttccat tcctgctggc acagtcactg tgaatgctgc tcaactctcc tccatgccag      1680
     gcctccagac cattaacctc agtgcattgg gtacttcagg aatccaggtg cacccaattc      1740
     aaggcctgcc gttggctata gcaaatgccc caggtgatca tggagctcag cttggtctcc      1800
     atggggctgg tggtgatgga atacatgatg acacagcagg tggagaggaa ggagaaaaca      1860
     gcccagatgc ccaaccccaa gccggtcgga ggacccggcg ggaagcatgc acctgcccct      1920
     actgtaaaga cagtgaagga aggggctcgg gggatcctgg caaaaagaaa cagcatattt      1980
     gccacatcca aggctgtggg aaagtgtatg gcaagacctc tcacctgcgg gcacacttgc      2040
     gctggcatac aggcgagagg ccatttatgt gtacctggtc atactgtggg aaacgcttca      2100
     cacgttcgga tgagctacag aggcacaaac gtacacacac aggtgagaag aaatttgcct      2160
     gccctgagtg tcctaagcgc ttcatgagga gtgaccacct gtcaaaacat atcaagaccc      2220
     accagaataa gaagggaggc ccaggtgtag ctctgagtgt gggcactttg cccctggaca      2280
     gtggggcagg ttcagaaggc agtggcactg ccactccttc agcccttatt accaccaata      2340
     tggtagccat ggaggccatc tgtccagagg gcattgcccg tcttgccaac agtggcatca      2400
     acgtcatgca ggtggcagat ctgcagtcca ttaatatcag tggcaatggc ttctgagatc      2460
     aggcacccgg ggccagagac atatgggcca taccccttaa ccccgggatg caaggtagca      2520
     tgggtccaag agacatggaa gagagagcca tgaagcatta aaatgcatgg tgttgagaag      2580
     aatcaggaga gggatacaag agaggagatg gggtcccggc acccatctgt atcatcagtg      2640
     cctctttgaa ggtgggaaac attagtgaaa attctgttgg tgccacgctt tgatgagcat      2700
     ttgtttgacc ccagtttctt cttacacttc ttaccccagc ctacccttcc tgcatttctc      2760
     ttctcagctc ttccatgatg gattcccccc ccttcctaaa gccatcatgc cttgataaat      2820
     atatatgatc attgaaatac tttttaacaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa      2880
     aaaaaaaaaa a                                                           2891
//

If you have problems or comments...

PBIL Back to PBIL home page