CHOIX_LIST : To choose the list to work with

Click on one of the lists appearing in panel Current lists,



SAVE_LIST : To save in a file the names or accession numbers of the members of a list.

Modify the proposed filename if desired.
For a sequence list, choose between 2 options:
  1. seq. names : to save sequence names
  2. accession #s : to save accession numbers (safer when names change)



EXTRACT : To copy in a file the sequences contained in a list.

Modify the proposed filename if desired. Progress of the extraction procedure is graphically displayed and may be interrupted by pressing the interrupt button that appears then.
Outfile format.

Five types of outputs are proposed. Choose that you want.

  1. File texte
  2. File format GCG
  3. File format FASTA
  4. File format ANALSEQ
  5. File format EMBL
All but gcgput all sequences in a single file.

Extraction type.

Five types of extractions are proposed.

  • Simple : Just extract the sequences or subsequences of list.
  • Translate to protein : Useful for protein-coding subsequences only. Will translate them in protein on the fly using the adequate reading frame and genetic code. Does nothing on non-protein coding sequences.
  • Extract fragment : Allows to extract any part of the sequence(s) in list. Such part is specified in the dialog box appearing later following the syntax suggested by these examples:
    
    132,1600        to extract from nucl. 132 to nucl 1600 of the sequence
    -10,10          to extract from 10 nucl. BEFORE the 5' end of the sequence
                    to nucl. 10 of it. Useful only for subsequences, and produces
                    a fragment extracted from its parent sequence.
    e-20,e+10       to extract from 20 nucl. BEFORE the 3' end of the sequence
                    to 10 nucl. AFTER its 3' end. Useful only for subsequences, and 
                    produces a fragment extracted from its parent sequence.
    -20,e+5         to extract from 20 nucl. BEFORE the 5' end of the sequence
                    to 5 nucl. AFTER its 3' end.
    
  • Extract feature : Allows to extract any feature listed in the feature table of sequences in list. Select the desired feature in the scrolling list appearing later. Meaningful only for parent sequences (subsequences have no feature table) and to access those features that do not correspond to a subsequence (e.g., EXON, mRNA, PRIM_TRANSCRIPT, REP_ORIGIN).
  • Extract feature region : Allows to extract a region surrounding a feature of sequences in list. Select next the desired feature as explained above and the desired fragment around this feature as explained for option extract fragment above.


EDIT_LIST : Edit the content of a list by removing or retaining specified elements.

On the window labelled list edit appearing next choose
  1. remove to exclude members of the list
  2. retain to keep members of the list
  • The list content appears in a scroller, and desired elements can be selectively removed/retained by clicking on them.
  • After clicking on button done the edited list appears as a new list in panel Current lists



SCAN_LIST : To scan annotations of sequences in a list for presence of a string

Enter the desired string (or word) in the dialog window that appears next.
Any sequence having this string in its annotations will be put in a new list



ADD_PARENTS : Replace subsequences of a list by their parent sequences

The list is scanned, and each subsequence is replaced by its parent sequences. Parent sequences are transmitted unaltered. The resulting list appears as a new list in panel Current list.


ADD_SUBSEQS : Add to a list all subsequences of parent sequences therein

The list is scanned, and all subsequences of each parent sequence are added to the list. Subsequences are transmitted unaltered.
The resulting list appears as a new list in panel Current list.


DELETE_LIST : Deletes the list selected in panel Current lists

This releases the memory occupied by a list and may be necessary before creating new lists when many lists have been previously built.


SEL_BY_LENGTH : Select sequences from a list according to their length

Type in the dialog box appearing next the length choice desired as:
  • >1000 to retain sequences of length >= 1000
  • <5000 to retain sequences of length <= 5000
Retained sequences appear in a new list in panel Current list.


SEL_BY_DATE : Select sequences from a list according to the date of their last update

Type in the dialog box appearing next the date choice desired as:
  • 1/juin/93 to retain sequences updated after June 1st, 1993
  • 31/dec/89 to retain sequences updated before Dec 31st, 1989
Valid months are: jan feb mar apr may jun jul aug sep oct nov dec
Retained sequences appear in a new list in panel Current list.


TOTAL_BASES : To compute the total length of sequences in a list

The total appears in panel Status.