(data stored in SCRATCH zone)

SWISSPROT: A4ATG0_MARSH

ID   A4ATG0_MARSH            Unreviewed;      1232 AA.
AC   A4ATG0;
DT   03-APR-2007, integrated into UniProtKB/TrEMBL.
DT   03-APR-2007, sequence version 1.
DT   08-MAY-2019, entry version 47.
DE   SubName: Full=Hypothetical glycine-rich protein {ECO:0000313|EMBL:EAR00730.1};
GN   OrderedLocusNames=FB2170_16636 {ECO:0000313|EMBL:EAR00730.1};
OS   Maribacter sp. (strain HTCC2170 / KCCM 42371).
OC   Bacteria; Bacteroidetes; Flavobacteriia; Flavobacteriales;
OC   Flavobacteriaceae; Maribacter.
OX   NCBI_TaxID=313603 {ECO:0000313|EMBL:EAR00730.1, ECO:0000313|Proteomes:UP000001602};
RN   [1] {ECO:0000313|EMBL:EAR00730.1, ECO:0000313|Proteomes:UP000001602}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=HTCC2170 / KCCM 42371 {ECO:0000313|Proteomes:UP000001602};
RX   PubMed=21037013; DOI=10.1128/JB.01207-10;
RA   Oh H.M., Kang I., Yang S.J., Jang Y., Vergin K.L., Giovannoni S.J.,
RA   Cho J.C.;
RT   "Complete genome sequence of strain HTCC2170, a novel member of the
RT   genus Maribacter in the family Flavobacteriaceae.";
RL   J. Bacteriol. 193:303-304(2011).
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   EMBL; CP002157; EAR00730.1; -; Genomic_DNA.
DR   RefSeq; WP_013304533.1; NC_014472.1.
DR   STRING; 313603.FB2170_16636; -.
DR   EnsemblBacteria; EAR00730; EAR00730; FB2170_16636.
DR   KEGG; fbc:FB2170_16636; -.
DR   eggNOG; ENOG4107G9F; Bacteria.
DR   eggNOG; ENOG410ZNMK; LUCA.
DR   HOGENOM; HOG000085654; -.
DR   OMA; DIICDEQ; -.
DR   OrthoDB; 1303930at2; -.
DR   BioCyc; MSP313603:G1GNS-105-MONOMER; -.
DR   Proteomes; UP000001602; Chromosome.
DR   InterPro; IPR008160; Collagen.
DR   Pfam; PF01391; Collagen; 3.
PE   4: Predicted;
DR   PRODOM; A4ATG0.
DR   SWISS-2DPAGE; A4ATG0.
KW   Complete proteome {ECO:0000313|Proteomes:UP000001602};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001602};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     20       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        21   1232       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5002666204.
SQ   SEQUENCE   1232 AA;  120706 MW;  E17EBD51A25DD766 CRC64;
     MNIKKLTILL LLLCANNVIA QLKVGDSPDK IDKSSLLELE SRDKALVLSR VNTSLMQKIK
     PLAGALIYNT DTSCVYHYDG KNWNSMCNSS NGQGLSFVDN GNGTFTIRNS DGTSYTSNSP
     TEQNGEKGDT GETGDQGIQG ETGLTGPNGE QGIQGEKGDT GETGAQGIQG ETGAQGEKGD
     TGETGAQGIQ GETGPQGEKG DTGETGAQGI QGETGAQGIQ GETGPQGEKG DTGETGPQGE
     KGDTGETGAQ GIQGETGPQG EKGDTGETGP QGEKGDTGET GPQGEKGDTG ETGPQGDTGE
     TGAQGIQGKV GPQGEKGDTG ETGSQGIQGE TGPQGETGPQ GDTGKTGPQG DTGETGAQGI
     QGETGLTGPN GEQGIQGEKG ETGETGDQGI QGETGPQGET GVQGIQGETG AQGIQGETGP
     QGEKGETGAQ GIQGETGPQG DTGETGAQGI QGETGAQGIQ GETGPQGEKG ETGETGDQGI
     QGETGAQGIQ GETGAQGIQG EVGPLGEKGD TGETGDQGIQ GETGLTGPNG EQGIQGEKGE
     TGETGAQGIQ GETGPQGEKG DTGETGDQGI QGETGAQGIQ GETGPQGEKG DTGETGPQGE
     KGDTGETGAQ GIQGETGPQG EKGDTGETGD QGIQGETGPQ GEKGDTGETG AQGIQGETGA
     QGIQGETGPQ GEKGETGETG DQGIQGETGP QGEKGDTGET GAQGIQGETG LTGETGAQGI
     QGETGAQGIQ GEVGPQGETG PLGEKGDTGE TGDQGIQGET GPQGEKGDTG ETGVQGIQGE
     TGPQGEKGDT GETGAQGIQG ETGLTGPNGE QGIQGEKGET GETGAQGIQG ETGAQGIQGE
     TGPQGEKGDT GETGDQGIQG ETGPQGEKGD TGETGAQGIQ GETGAQGIQG ETGPQGEKGD
     TGETGDQGIQ GEVGPQGEKG DTGETGAQGI QGETGAQGIQ GEVGPQGDTG ETGATGPQGE
     TGAQGEKGDT GETGAQGIQG ETGPQGEKGD TGETGAQGIQ GETGPQGEKG DTGETGPQGI
     QGETGLTGET GAQGIQGETG PQGEKGDTGE TGAQGIQGEV GLQGEKGDTG ETGAQGDPAT
     NISTNLVQNT STGTITYTNE TSTEQVANVV GAETNNSLTL GTNGGAFYES PIKAFGKIAS
     NGSVTRATSG VTAMRISTGR YQVTLPPGTV SDSNYIIQLT QPGRGGAGND DPGISYSNQT
     STNFEVIIGD NDNGGTDRAR FNSEFMFTIL DL
//

If you have problems or comments...

PBIL Back to PBIL home page