| extractseqs {seqinr} | R Documentation |
The function allows to extract large amount of data as whole genome sequences,using different output formats and types of extraction. This function is not yet available for windows.
extractseqs(listname,socket = autosocket(), format="fasta",operation="simple",feature="xx", bounds="xx", minbounds="xx", verbose = FALSE, nzlines=1000) exseq(listname,socket = autosocket(), format="fasta",operation="simple", feature="xx", bounds="xx", minbounds="xx", verbose = FALSE, nzlines=1000)
listname |
the name of list on server (may be a virtual list) |
socket |
an object of class sockconn connecting to a remote ACNUC
database (default is a socket to the last opened database). |
format |
the format of output.Can be acnuc, fasta,flat or coordinates |
operation |
the type of extraction. Can be simple, translate, fragment, feature or region |
feature |
-optional- the feature to be extracted (for operations "feature" or "region"): a feature table item (CDS, mRNA,...) |
bounds |
-optional- the bounds for extraction (for operations "fragment" or "region") |
minbounds |
-optional- the minimal bounds for extraction (for operations "fragment" or "region") |
verbose |
if TRUE, verbose mode is on |
nzlines |
number of line in zlib mode |
To extract a list of sequences (lrank argument) or a single sequence (seqnum argument) using different output formats and types of extraction. All formats except "coordinates" extract sequence data. Format "coordinates" extract coordinate data; start > end indicates the complementary strand.
choosebank.
Default value (auto) means that the socket will be set to to the socket component of the banknameSocket variable. acnuc, fasta, flat or coordinatessimple, translate, fragment, feature or region| 132,1600 | to extract from nucl. 132 to nucl 1600 of the sequence. If applied to a subsequence, coordinates are in the parent seq relatively to the subsequence start point. | |
| -10,10 | to extract from 10 nucl. BEFORE the 5' end of the sequence to nucl. 10 of it. Useful only for subsequences, and produces a fragment extracted from its parent sequence. | |
| e-20,e+10 | to extract from 20 nucl. BEFORE the 3' end of the sequence to 10 nucl. AFTER its 3' end. Useful only for subsequences, and produces a fragment extracted from its parent sequence. | |
| -20,e+5 | to extract from 20 nucl. BEFORE the 5' end of the sequence to 5 nucl. AFTER its 3' end. |
Sequence data.
S.Penel
citation("seqinr")
## Not run:
# Need internet connection
choosebank("swissprotTP")
query("mylist", "k=globin", virtual = TRUE)
mylist.fasta <- exseq("mylist", verbose = TRUE)
# 103 lines of FASTA
stopifnot(length(mylist.fasta) == 103)
closebank()
## End(Not run)