(data stored in ACNUC21429 zone)

EMBL: BC039294

ID   BC039294; SV 1; linear; mRNA; STD; HUM; 3885 BP.
XX
AC   BC039294;
XX
DT   07-NOV-2002 (Rel. 73, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 13)
XX
DE   Homo sapiens ubiquilin 1, mRNA (cDNA clone MGC:39500 IMAGE:5263680),
DE   complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-3885
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-3885
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (01-NOV-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 9e82f2e87027fab48a7a3aec0d06ba03.
DR   Ensembl-Gn; ENSG00000135018; homo_sapiens.
DR   Ensembl-Tr; ENST00000257468; homo_sapiens.
DR   Ensembl-Tr; ENST00000376395; homo_sapiens.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
CC   cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
CC   Toshiyuki and Piero Carninci (RIKEN)
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Sequencing Group at the Stanford Human Genome
CC   Center, Stanford University School of Medicine, Stanford, CA  94305
CC   Web site:       http://www-shgc.stanford.edu
CC   Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
CC   Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
CC   R. M.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 63 Row: e Column: 3
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 16753204.
CC   Differences found between this sequence and the human reference
CC   genome (build 36) are described in misc_difference features below
CC   and these differences were also compared to chimpanzee genomic
CC   sequences available as of 09/15/2004.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3885
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_95"
FT                   /clone="MGC:39500 IMAGE:5263680"
FT                   /tissue_type="Brain, hippocampus"
FT                   /note="Vector: pBluescriptR"
FT                   /db_xref="taxon:9606"
FT   gene            1..3885
FT                   /gene="UBQLN1"
FT                   /note="synonyms: DSK2, PLIC-1, XDRP1, DA41"
FT   CDS_pept        266..2035
FT                   /codon_start=1
FT                   /gene="UBQLN1"
FT                   /product="ubiquilin 1"
FT                   /db_xref="GOA:Q9UMX0"
FT                   /db_xref="H-InvDB:HIT000052187.17"
FT                   /db_xref="HGNC:HGNC:12508"
FT                   /db_xref="InterPro:IPR000626"
FT                   /db_xref="InterPro:IPR006636"
FT                   /db_xref="InterPro:IPR009060"
FT                   /db_xref="InterPro:IPR015496"
FT                   /db_xref="InterPro:IPR015940"
FT                   /db_xref="InterPro:IPR028799"
FT                   /db_xref="InterPro:IPR029071"
FT                   /db_xref="PDB:2JY5"
FT                   /db_xref="PDB:2JY6"
FT                   /db_xref="PDB:2KLC"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9UMX0"
FT                   /protein_id="AAH39294.1"
FT                   /translation="MAESGESGGPPGSQDSAAGAEGAGTPAAAASAEPKIMKVTVKTPK
FT                   EKEEFAVPENSSVQQFKEEISKRFKSHTDQLVLIFAGKILKDQDTLSQHGIHDGLTVHL
FT                   VIKTQNRPQDHSAQQTNTAGSNVTTSSTPNSNSTSGSATSNPFGLGGLGGLAGLSSLGL
FT                   NTTNFSELQSQMQRQLLSNPEMMVQIMENPFVQSMLSNHDLMRQLIMANPQMQQLIQRN
FT                   PEISHMLNNPDIMRQTLELARNPAMMQEMMRNQDRALSNLESIPGGYNALRRMYTDIQE
FT                   PMLSAAQEQFGGNPFASLVSNTSSGEGSQPSRTENRDPLPNPWAPQTSQSSSASSGTAS
FT                   TVGGTTGSTASGTSGQSTTAPNLVPGVGASMFNTPGMQSLLQQITENPQLMQNMLSAPY
FT                   MRSMMQSLSQNPDLAAQMMLNNPLFAGNPQLQEQMRQQLPTFLQQMQNPDTLSAMSNPR
FT                   AMQALLQIQQGLQTLATEAPGLIPGFTPGLGALGSTGGSSGTNGSNATPSENTSPTAGT
FT                   TEPGHQQFIQQMLQALAGVNPQLQNPEVRFQQQLEQLSAMGFLNREANLQALIATGGDI
FT                   NAAIERLLGSQPS"
FT   misc_difference 338
FT                   /gene="UBQLN1"
FT                   /note="'A' in cDNA is 'G' in the human genome; amino acid
FT                   difference: 'T' in cDNA, 'A' in the human genome. The
FT                   chimpanzee genome agrees with the human genomic sequence
FT                   and not the cDNA."
FT   misc_difference 870
FT                   /gene="UBQLN1"
FT                   /note="'A' in cDNA is 'C' in the human genome; amino acid
FT                   difference: 'H' in cDNA, 'P' in the human genome. The
FT                   chimpanzee genome agrees with the human genomic sequence
FT                   and not the cDNA."
FT   misc_difference 1486
FT                   /gene="UBQLN1"
FT                   /note="'G' in cDNA is 'A' in the human genome; no amino
FT                   acid change. The chimpanzee genome agrees with the human
FT                   genomic sequence and not the cDNA."
FT   misc_difference 3859..3885
FT                   /gene="UBQLN1"
FT                   /note="polyA tail: 27 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 3885 BP; 1135 A; 825 C; 845 G; 1080 T; 0 other;
     ggtggctgct gcggatgtcg gtgtgagcga gcggcgcctg aacacacggc ggctgccgag        60
     cgcctgaccc gggcctgcgc cagagcctgc accgagctcc ggggccccac acccgctacg       120
     gtggccctgc gcccgttgct actgaggcgg cgtgctctgc attcttcgct gtccaggcct       180
     gccggctctg gtgtctgctg gctcctcctt gctcgcctgc tccctcctgc ttgcctgagt       240
     caccgccgcc gccgccgcca cagccatggc cgagagtggt gaaagcggcg gtcctccggg       300
     ctcccaggat agcgccgccg gagccgaagg tgctggcacc cccgcggccg ctgcctccgc       360
     ggagcccaaa atcatgaaag tcaccgtgaa gaccccgaag gaaaaggagg aattcgccgt       420
     gcccgagaat agctccgtcc agcagtttaa ggaagaaatc tctaaacgtt ttaaatcaca       480
     tactgaccaa cttgtgttga tatttgctgg aaaaattttg aaagatcaag ataccttgag       540
     tcagcatgga attcatgatg gacttactgt tcaccttgtc attaaaacac aaaacaggcc       600
     tcaggatcat tcagctcagc aaacaaatac agctggaagc aatgttacta catcatcaac       660
     tcctaatagt aactctacat ctggttctgc tactagcaac ccttttggtt taggtggcct       720
     tgggggactt gcaggtctga gtagcttggg tttgaatact accaacttct ctgaactaca       780
     gagtcagatg cagcgacaac ttttgtctaa ccctgaaatg atggtccaga tcatggaaaa       840
     tccctttgtt cagagcatgc tctcaaatca tgacctgatg agacagttaa ttatggccaa       900
     tccacaaatg cagcagttga tacagagaaa tccagaaatt agtcatatgt tgaataatcc       960
     agatataatg agacaaacgt tggaacttgc caggaatcca gcaatgatgc aggagatgat      1020
     gaggaaccag gaccgagctt tgagcaacct agaaagcatc ccagggggat ataatgcttt      1080
     aaggcgcatg tacacagata ttcaggaacc aatgctgagt gctgcacaag agcagtttgg      1140
     tggtaatcca tttgcttcct tggtgagcaa tacatcctct ggtgaaggta gtcaaccttc      1200
     ccgtacagaa aatagagatc cactacccaa tccatgggct ccacagactt cccagagttc      1260
     atcagcttcc agcggcactg ccagcactgt gggtggcact actggtagta ctgccagtgg      1320
     cacttctggg cagagtacta ctgcgccaaa tttggtgcct ggagtaggag ctagtatgtt      1380
     caacacacca ggaatgcaga gcttgttgca acaaataact gaaaacccac aactgatgca      1440
     aaacatgttg tctgccccct acatgagaag catgatgcag tcactgagcc agaatcctga      1500
     ccttgctgca cagatgatgc tgaataatcc cctatttgct ggaaatcctc agcttcaaga      1560
     acaaatgaga caacagctcc caactttcct ccaacaaatg cagaatcctg atacactatc      1620
     agcaatgtca aaccctagag caatgcaggc cttgttacag attcagcagg gtttacagac      1680
     attagcaacg gaagccccgg gcctcatccc agggtttact cctggcttgg gggcattagg      1740
     aagcactgga ggctcttcgg gaactaatgg atctaacgcc acacctagtg aaaacacaag      1800
     tcccacagca ggaaccactg aacctggaca tcagcagttt attcagcaga tgctgcaggc      1860
     tcttgctgga gtaaatcctc agctacagaa tccagaagtc agatttcagc aacaactgga      1920
     acaactcagt gcaatgggat ttttgaaccg tgaagcaaac ttgcaagctc taatagcaac      1980
     aggaggtgat atcaatgcag ctattgaaag gttactgggc tcccagccat catagcagca      2040
     tttctgtatc ttgaaaaaat gtaatttatt tttgataacg gctcttaaac tttaaaatac      2100
     ctgctttatt tcattttgac tcttggaatt ctgtgctgtt ataaacaaac ccaatatgat      2160
     gcattttaag gtggagtaca gtaagatgtg tgggtttttc tgtatttttc ttttctggaa      2220
     cagtgggaat taaggctact gcatgcatca cttctgcatt tattgtaatt ttttaaaaac      2280
     atcacctttt atagttgggt gaccagattt tgtcctgcat ctgtccagtt tatttgcttt      2340
     ttaaacatta gcctatggta gtaatttatg tagaataaaa gcattaaaaa gaagcaaatc      2400
     atttgcactc tataatttgt ggtacagtat tgcttattgt gactttggca tgcatttttg      2460
     caaacaatgc tgtaagattt atactactga taattttgtt ttatttgtat acaatataga      2520
     gtatgcacat ttgggactgc atttctggaa acatactgca ataggctctc tgagcaaaac      2580
     acctgtaact aaaaaagtga agataagaaa atactcttaa agctgagtat ttcctaattg      2640
     tatagaatct tacagcatct ttgacaaaca tctcccagca aaagtgccgg ttagtcaggt      2700
     ttgttgaaaa tacagtagaa aagctgattc tggttatctc tttaaggaca attaattgta      2760
     cagacacata atgtaacatt gtctcaacat tcattcacag attgactgta aattacctta      2820
     atctttgtgc agactgaagg aacactgtag tataccccaa agtgcatttg cctaggactt      2880
     ctcagcttct cccataggta gtttaacagg cattaaaatt tgtaattgaa atgttgcttt      2940
     cactgaaaaa gtgtcttgat gtttcagtta tttttaatcg ccataaaaaa atagaactat      3000
     cttttgggtt tatctgtttt ctcatgcaca ggcaatacac aaatttaaaa tgagttgtga      3060
     gccaattgtt tctgaagtgt tttggtagtt ctattaagaa atagttaaat attgtgcttt      3120
     tcagagcctc agagaaaggg ggacggggtg ggggggtggg gcagcggaat ctgtcctgga      3180
     tggggccagc ttaaataata ctggcaacca agattctgtt aggatttctg tgcatatagt      3240
     gtagtaaaga agtatcattc aggggtgaaa aacaaagagc cgttttaatg atgttgagta      3300
     catttggctg ttttatagcc tttttcttcc ctcccccaaa gaattctgtt tgcctaactc      3360
     ccaaactgtt ggggtggtac attcctttag gaccaattaa aacataattg agggtcagtg      3420
     atacatttgg ctgactctgg ttcagtattc tcttaggtga ttatattctc tcatgtacag      3480
     ttacaggaaa ttaaaatgtt aaagtaacct aaaatgaatt cagaccaata aaatcaaggg      3540
     aaatacaagt tgattgcatt acttctgtat gttgcttgct attaaaaagg ttaagaggcc      3600
     aggttaccca ccagtccttg cactgttctg acactttccc caggaggaaa acaagtacaa      3660
     aggttacggt ggaggcataa gtagaagaga ttgttaagaa gggtattcat gtgtctttgc      3720
     tctttctgct ttatgcctca gtttggttta aaaacttctg tactggcaaa tggtggtatt      3780
     cagtgtggga tagtgtcata actaatttga caatttatta atcataaaat aacaataaat      3840
     ctctagcttt tacacttgaa aaaaaaaaga aaaaaaaaaa aaaaa                      3885
//

If you have problems or comments...

PBIL Back to PBIL home page