(data stored in ACNUC21429 zone)

EMBL: BC069237

ID   BC069237; SV 1; linear; mRNA; STD; HUM; 3244 BP.
XX
AC   BC069237;
XX
DT   28-APR-2004 (Rel. 79, Created)
DT   27-MAY-2005 (Rel. 83, Last updated, Version 2)
XX
DE   Homo sapiens ubiquilin 2, mRNA (cDNA clone MGC:78469 IMAGE:4543266),
DE   complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-3244
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-3244
RA   Strausberg R.;
RT   ;
RL   Submitted (26-APR-2004) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Cancer
RL   Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03,
RL   Bethesda, MD 20892-2590, USA
XX
DR   MD5; dae85affcd4d88974323b4ec17ae7a12.
DR   Ensembl-Gn; ENSG00000188021; homo_sapiens.
DR   Ensembl-Tr; ENST00000338222; homo_sapiens.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: ATCC
CC   cDNA Library Preparation: Rubin Laboratory
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Genome Sequence Centre,
CC   BC Cancer Agency, Vancouver, BC, Canada
CC   info@bcgsc.bc.ca
CC   Steve Jones, Sarah Barber, Mabel Brown-John, Yaron Butterfield,
CC   Andy Chan, Steve S. Chand, William Chow, Alison Cloutier, Ruth
CC   Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
CC   Kim MacDonald, Amara Masson, Mike R. Mayo, Josh Moran, Ryan Morin,
CC   Teika Olson, Diana Palmquist, Anca Petrescu, Anna Liisa Prahbu,
CC   Parvaneh Saeedi, JR Santos, Angelique Schnerch, Ursula Skalska,
CC   Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie
CC   Schein, Asim Siddiqui, Rob Holt, Marco Marra.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAL Plate: 54 Row: c Column: 4
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 16753206.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3244
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B-R"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_21"
FT                   /clone="MGC:78469 IMAGE:4543266"
FT                   /tissue_type="Placenta, choriocarcinoma"
FT                   /note="Vector: pOTB7"
FT                   /db_xref="taxon:9606"
FT   gene            1..3244
FT                   /gene="UBQLN2"
FT                   /note="synonyms: CHAP1/DSK2, HRIHFB2157, PLIC-2, N4BP4,
FT                   Dsk2, LIC-2, PLIC2, CHAP1"
FT   CDS_pept        153..2027
FT                   /codon_start=1
FT                   /gene="UBQLN2"
FT                   /product="UBQLN2 protein"
FT                   /db_xref="GOA:Q9UHD9"
FT                   /db_xref="H-InvDB:HIT000263391.14"
FT                   /db_xref="HGNC:HGNC:12509"
FT                   /db_xref="InterPro:IPR000626"
FT                   /db_xref="InterPro:IPR006636"
FT                   /db_xref="InterPro:IPR009060"
FT                   /db_xref="InterPro:IPR015496"
FT                   /db_xref="InterPro:IPR015940"
FT                   /db_xref="InterPro:IPR016024"
FT                   /db_xref="InterPro:IPR028430"
FT                   /db_xref="InterPro:IPR029071"
FT                   /db_xref="InterPro:IPR041243"
FT                   /db_xref="PDB:1J8C"
FT                   /db_xref="PDB:2NBV"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9UHD9"
FT                   /protein_id="AAH69237.1"
FT                   /translation="MAENGESSGPPRPSRGPAAAQGSAAAPAEPKIIKVTVKTPKEKEE
FT                   FAVPENSSVQQFKEAISKRFKSQTDQLVLIFAGKILKDQDTLIQHGIHDGLTVHLVIKS
FT                   QNRPQGQSTQPSNAAGTNTTSASTPRSNSTPISTNSNPFGLGSLGGLAGLSSLGLSSTN
FT                   FSELQSQMQQQLMASPEMMIQIMENPFVQSMLSNPDLMRQLIMANPQMQQLIQRNPEIS
FT                   HLLNNPDIMRQTLEIARNPAMMQEMMRNQDLALSNLESIPGGYNALRRMYTDIQEPMLN
FT                   AAQEQFGGNPFASVGSSSSSGEGTQPSRTENRDPLPNPWAPPPATQSSATTSTTTSTGS
FT                   GSGNSSSNATGNTVAAANYVASIFSTPGMQSLLQQITENPQLIQNMLSAPYMRSMMQSL
FT                   SQNPDLAAQMMLNSPLFTANPQLQEQMRPQLPAFLQQMQNPDTLSAMSNPRAMQALMQI
FT                   QQGLQTLATEAPGLIPSFTPGVGVGVLGTAIGPVGPVTPIGPIGPIVPFTPIGPIGPIG
FT                   PTGPAAPPGSTGSGGPTGPTVSSAAPSETTSPTSESGPNQQFIQQMVQALAGANAPQLP
FT                   NPEVRFQQQLEQLNAMGFLNREANLQALIATGGDINAAIERLLGSQPS"
FT   misc_feature    285..461
FT                   /gene="UBQLN2"
FT                   /note="ubiquitin; Region: Ubiquitin family. This family
FT                   contains a number of ubiquitin-like proteins: SUMO (smt3
FT                   homologue), Nedd8, Elongin B, Rub1"
FT   misc_feature    1899..2012
FT                   /gene="UBQLN2"
FT                   /note="UBA; Region: Ubiquitin Associated domain. The UBA
FT                   domain is a commonly occurring sequence motif in some
FT                   members of the ubiquitination pathway, UV excision repair
FT                   proteins, and certain protein kinases. Although its
FT                   specific role is so far unknown, it has been suggested that
FT                   UBA domains are involved in conferring protein target
FT                   specificity. The domain, a compact three helix bundle, has
FT                   a conserved GFP-loop and the proline is thought to be
FT                   critical for binding. The UBA domain is distinct from the
FT                   conserved three helical domain seen in the N-terminus of
FT                   EF-TS and eukaryotic NAC proteins"
XX
SQ   Sequence 3244 BP; 879 A; 831 C; 698 G; 836 T; 0 other;
     agagttgctg ggagtgcgcg cggtcggatc acaaggcggc ggcggaggag gcccagcccg        60
     ctgcggcggt gcctccttcc ttcctccttc cctcgcgctc tctctttcgc ccgcccgcgc       120
     cttccctgcc cgcctgcgtc accgcggccg ccatggctga gaatggcgag agcagcggcc       180
     ccccgcgccc ctcccgcggc cctgctgcgg cccaaggctc ggctgctgcc ccggctgagc       240
     ctaaaatcat caaagtcacg gtgaagactc ccaaagagaa agaggagttc gcggtgcccg       300
     agaacagctc ggttcagcag tttaaggaag cgatttcgaa acgcttcaaa tcccaaaccg       360
     atcagctagt gctgattttt gccggaaaaa tcttaaaaga tcaagatacc ttgatccagc       420
     atggcatcca tgatgggctg actgttcacc ttgtcatcaa aagccagaac cgacctcagg       480
     gccagtccac gcagcctagc aatgccgcgg gaactaacac tacctcggcg tcgactccca       540
     ggagtaactc cacacctatt tccacaaata gcaacccgtt tgggttgggg agcctgggag       600
     gacttgcagg ccttagcagc ctgggcttga gctcgaccaa cttctctgag ctccagagcc       660
     agatgcagca gcagcttatg gccagccctg agatgatgat ccaaataatg gaaaatccct       720
     ttgttcagag catgctttcg aatcccgatc tgatgaggca gctcattatg gctaatccac       780
     agatgcagca attgattcag agaaacccag aaatcagtca cctgctcaac aacccagaca       840
     taatgaggca gacactcgaa attgccagga atccagccat gatgcaagag atgatgagaa       900
     atcaagacct ggctcttagc aatctagaaa gcatcccagg tggctataat gctttacggc       960
     gcatgtacac tgacattcaa gagccgatgc tgaatgccgc acaagagcag tttgggggta      1020
     atccatttgc ctccgtgggg agtagttcct cctctgggga aggtacgcag ccttcccgca      1080
     cagaaaatcg cgatccacta cccaatccat gggcaccacc gccagctacc cagagttctg      1140
     caactaccag cacgaccaca agcactggta gtgggtctgg caatagttcc agcaatgcta      1200
     ctgggaacac cgttgctgcc gctaattatg tcgccagcat ctttagtacc ccaggcatgc      1260
     agagcctgct gcaacagata actgaaaacc cccagctgat tcagaatatg ctgtcggcgc      1320
     cctacatgag aagcatgatg cagtcgctga gccagaatcc agatttggct gcacagatga      1380
     tgctgaatag cccgctgttt actgcaaatc ctcagctgca ggagcagatg cggccacagc      1440
     tcccagcctt cctgcagcag atgcagaatc cagacacact atcagccatg tcaaacccaa      1500
     gagcaatgca ggctttaatg cagatccagc aggggctaca gacattagcc actgaagcac      1560
     ctggcctgat tccgagcttc actccaggtg tgggggtggg ggtgctggga accgctatag      1620
     gccctgtagg cccagtcacc cccataggcc ccataggccc tatagtccct tttaccccca      1680
     taggccccat tgggcccata ggacccactg gccctgcagc cccccctggc tccaccggct      1740
     ctggtggccc cacggggcct actgtgtcca gcgctgcacc tagtgaaacc acgagtccta      1800
     catcagaatc tggacccaac cagcagttca ttcagcaaat ggtgcaggcc ctggctggag      1860
     caaatgctcc acagctgccg aatccagaag tcagatttca gcaacaactg gaacagctca      1920
     acgcaatggg gttcttaaac cgtgaagcaa acttgcaggc cctaatagca acaggaggcg      1980
     acatcaatgc agccattgaa aggctgctgg gctcccagcc atcgtaatca catttctgta      2040
     cctggaaaaa aaatgtatct tatttttgat aatggctctt aaatctttaa acacacacac      2100
     aaaatcgttc tttactttca ttttgattct tttaaatctg tctagttgta agtctaatat      2160
     gatgcatttt aagatggagt ccctccctcc tacttccctc actccctttc tcctttgctt      2220
     atttttccta ccttcccttc ctcttgtctc cccactccct ccctctttgt ttccttcctt      2280
     ccttatttcc tttagtttcc ttccttagcc gttttgagtg gtgggaatca atgctgtttc      2340
     actcaaaagt gttgcatgca aacacttctc tttattctgc atttattgtg atttttggaa      2400
     acaggtatca accttcacag ttgggtgaac aagtgttgtc ctacagatgt ccaatttatt      2460
     tgcattttta aacattagcc tatgatagta atttaatgta gaatgaagat attaaaaaca      2520
     gaagcaaatt atttgaagct ctctaatttg tggtacgata ttgcttattg tgactttggc      2580
     atgtattttt gctagcaaaa tgctgtaaga tttataccat tgatcttttt tgctatattt      2640
     gtatacagta cagtaagcac aattggcact gtacatctaa aaatattaca gtagaatctg      2700
     agtgtaatat gtgtaaccaa aatgagaaag aatacaagaa atgtttctgg agctagttat      2760
     gtctcacaat tttgtagaat cttacagcat ctttgataaa cttctcagtg aaaatgttgg      2820
     ctaggcaagt tcagttaaaa tatagtagaa atgtttatcc tggtatctct aagtatacat      2880
     ttaattgtac agaaaattta cagtgtaaca ttgtgtcaac atttgcagat tgactgtata      2940
     tgaccttaat ctttgtgcag cctgaaggat cagtgtagta atgccaggaa agtgcttttt      3000
     acctaagact tccttctcag cttctcccat aaagagaccc taatatgcat tttgatttgt      3060
     aattggaaat gtaactttca ctgaaagtgt catgtgatgt ttgcattact tttaactgct      3120
     atgtataaag gaaagtgtgt cttttgactt catcagttat ttctcttgtg cacagagaaa      3180
     aatgcattaa aaatgactaa aaaaaataaa aaattaaaaa atggaaaaaa aaaaaaaaaa      3240
     aaaa                                                                   3244
//

If you have problems or comments...

PBIL Back to PBIL home page