(data stored in ACNUC27125 zone)

EMBL: BC020744

ID   BC020744; SV 1; linear; mRNA; STD; HUM; 1219 BP.
XX
AC   BC020744;
XX
DT   09-JAN-2002 (Rel. 70, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 12)
XX
DE   Homo sapiens aldo-keto reductase family 1, member C4 (chlordecone
DE   reductase; 3-alpha hydroxysteroid dehydrogenase, type I; dihydrodiol
DE   dehydrogenase 4), mRNA (cDNA clone MGC:22581 IMAGE:4734943), complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-1219
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-1219
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (03-JAN-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 9cef8fd7db9aad2e55bb9fa72ae6c0dc.
DR   Ensembl-Gn; ENSG00000198610; homo_sapiens.
DR   Ensembl-Tr; ENST00000263126; homo_sapiens.
DR   Ensembl-Tr; ENST00000380448; homo_sapiens.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: CLONTECH
CC   cDNA Library Preparation: CLONTECH Laboratories, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Sequencing Group at the Stanford Human Genome
CC   Center, Stanford University School of Medicine, Stanford, CA  94305
CC   Web site:       http://www-shgc.stanford.edu
CC   Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
CC   Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
CC   R. M.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAL Plate: 37 Row: j Column: 16
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 24497584.
CC   Differences found between this sequence and the human reference
CC   genome (build 36) are described in misc_difference features below
CC   and these differences were also compared to chimpanzee genomic
CC   sequences available as of 09/15/2004.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1219
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_76"
FT                   /clone="MGC:22581 IMAGE:4734943"
FT                   /tissue_type="Liver"
FT                   /note="Vector: pDNR-LIB"
FT                   /db_xref="taxon:9606"
FT   gene            1..1219
FT                   /gene="AKR1C4"
FT                   /note="synonyms: DD4, HAKRA, C11, CDR, MGC22581,
FT                   3-alpha-HSD"
FT   CDS_pept        34..1005
FT                   /codon_start=1
FT                   /gene="AKR1C4"
FT                   /product="aldo-keto reductase family 1, member C4
FT                   (chlordecone reductase; 3-alpha hydroxysteroid
FT                   dehydrogenase, type I; dihydrodiol dehydrogenase 4)"
FT                   /db_xref="GOA:P17516"
FT                   /db_xref="H-InvDB:HIT000038930.15"
FT                   /db_xref="HGNC:HGNC:387"
FT                   /db_xref="InterPro:IPR018170"
FT                   /db_xref="InterPro:IPR020471"
FT                   /db_xref="InterPro:IPR023210"
FT                   /db_xref="InterPro:IPR036812"
FT                   /db_xref="PDB:2FVL"
FT                   /db_xref="UniProtKB/Swiss-Prot:P17516"
FT                   /protein_id="AAH20744.1"
FT                   /translation="MDPKYQRVELNDGHFMPVLGFGTYAPPEVPRNRAVEVTKLAIEAG
FT                   FRHIDSAYLYNNEEQVGLAIRSKIADGSVKREDIFYTSKLWCTFFQPQMVQPALESSLK
FT                   KLQLDYVDLYLLHFPMALKPGETPLPKDENGKVIFDTVDLSATWEVMEKCKDAGLAKSI
FT                   GVSNFNYRQLEMILNKPGLKYKPVCNQVECHPYLNQSKLLDFCKSKDIVLVAHSALGTQ
FT                   RHKLWVDPNSPVLLEDPVLCALAKKHKRTPALIALRYQLQRGVVVLAKSYNEQRIRENI
FT                   QVFEFQLTSEDMKVLDGLNRNYRYVVMDFLMDHPDYPFSDEY"
FT   misc_difference 542
FT                   /gene="AKR1C4"
FT                   /note="'A' in cDNA is 'G' in the human genome; amino acid
FT                   difference: 'Y' in cDNA, 'C' in the human genome."
FT   misc_difference 782
FT                   /gene="AKR1C4"
FT                   /note="'G' in cDNA is 'A' in the human genome; amino acid
FT                   difference: 'R' in cDNA, 'Q' in the human genome. The
FT                   chimpanzee genome agrees with the cDNA sequence, suggesting
FT                   that this difference is unlikely to be due to an artifact."
FT   misc_difference 1094
FT                   /gene="AKR1C4"
FT                   /note="'G' in cDNA is 'A' in the human genome. The
FT                   chimpanzee genome agrees with the cDNA sequence, suggesting
FT                   that this difference is unlikely to be due to an artifact."
FT   misc_difference 1191
FT                   /gene="AKR1C4"
FT                   /note="'C' in cDNA is 'G' in the human genome. The
FT                   chimpanzee genome agrees with the cDNA sequence, suggesting
FT                   that this difference is unlikely to be due to an artifact."
FT   misc_difference 1195..1219
FT                   /gene="AKR1C4"
FT                   /note="polyA tail: 25 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 1219 BP; 371 A; 271 C; 279 G; 298 T; 0 other;
     acaggatctg cttagtgaaa gaagtggcaa gcaatggatc ccaaatatca gcgtgtagag        60
     ctaaatgatg gtcacttcat gcccgtattg ggatttggca cctatgcacc tccagaggtt       120
     ccgaggaaca gagctgtaga ggtcaccaaa ttagcaatag aagctggctt ccgccatatt       180
     gattctgctt atttatacaa taatgaggag caggttggac tggccatccg aagcaagatt       240
     gcagatggca gtgtgaagag agaagacata ttctacactt caaagctttg gtgcactttc       300
     tttcaaccac agatggtcca accagccttg gaaagctcac tgaaaaaact tcaactggac       360
     tatgttgacc tctatcttct tcatttccca atggctctca agccaggtga gacgccacta       420
     ccaaaagatg aaaatggaaa agtaatattc gacacagtgg atctctctgc cacatgggag       480
     gtcatggaga agtgtaagga tgcaggattg gccaagtcca tcggggtgtc aaacttcaac       540
     tacaggcagc tggagatgat cctcaacaag ccaggactca agtacaagcc tgtctgcaac       600
     caggtagaat gtcatcctta cctcaaccag agcaaactgc tggatttctg caagtcaaaa       660
     gacattgttc tggttgccca cagtgctctg ggaacccaac gacataaact atgggtggac       720
     ccaaactccc cagttctttt ggaggaccca gttctttgtg ccttagcaaa gaaacacaaa       780
     cgaaccccag ccctgattgc cctgcgctac cagctgcagc gtggggttgt ggtcctggcc       840
     aagagctaca atgagcagcg gatcagagag aacatccagg tttttgaatt ccagttgaca       900
     tcagaggata tgaaagttct agatggtcta aacagaaatt atcgatatgt tgtcatggat       960
     tttcttatgg accatcctga ttatccattt tcagatgaat attagcatag agggtgttgc      1020
     acgacatcta gcagaaggcc ctgtgtgtgg atggtgatgc agaggatgtc tctatgctgg      1080
     tgactggaca cacggcctct ggttaaatcc ctcccctcct gcttggcaac ttcagctagc      1140
     tagatatatc catggtccag aaagcaaaca taataaattt ttatcttgaa ctaaaaaaaa      1200
     aaaaaaaaaa aaaaaaaaa                                                   1219
//

If you have problems or comments...

PBIL Back to PBIL home page