Annotation Modifications

The annotations of the SWISS-PROT + TrEMBL and EMBL subsets used in HOINVGEN are slightly modified to include complementary data related to families.

SWISS-PROT + TrEMBL annotations

First we add for each entry a line in the CC field that gives the number of the family the sequence belongs to:
CC   -!- GENE_FAMILY: INV009948.
This number is incorporated in the keywords associated to the corresponding entry in the ACNUC database structure. Due to that fact it is possible to retrieve all the sequences associated to a family with this number when using the retrieval system Query or the on-line version WWW-Query.

Note: An other type of annotation is present in HOINVGEN. This annotation is relative to the use of a duplicated SWISSPROT-TrEMBL database:

CC   -!- modified from A70A_DROMA.
CC   -!- modified to A70A_DROSI.

EMBL annotations

In this subset we add for each coding sequence a qualifier that gives the number of the family the gene belongs to:
FT                   /gene_family="INV009948"

If you have problems or comments...

Back to PBIL home page