GFFStrandLoc: Strand-Wise Gene and Protein Extraction from GFF3 Files
Facilitates the extraction and organization of strand-specific genomic features from GFF3 files. In many species and variants, high quality genome annotations are not always available, necessitating de novo annotation using tools such as AUGUSTUS (Stanke et al., 2006; <doi:10.1093/nar/gkl200>). However, downstream processing of such annotations to obtain structured information, such as strand-wise gene locations, transcript regions, and associated protein identifiers—can be computationally intensive and complex.
'GFFStrandLoc' provides a streamlined framework to parse GFF3 files and generate structured outputs containing strand-wise and region-wise genomic coordinates for each transcript, along with their associated protein information. Additionally, it enables users to define custom promoter lengths and extract corresponding promoter region coordinates for genes in a strand-aware manner. By simplifying post-annotation processing, it enhances the usability of de novo annotated genomic datasets for downstream analysis and interpretation.
| Version: |
0.1.0 |
| Published: |
2026-04-15 |
| DOI: |
10.32614/CRAN.package.GFFStrandLoc (may not be active yet) |
| Author: |
Subham Ghosh [aut, cre],
Monendra Grover [aut],
Dipro Sinha [aut],
Dwijesh Mishra [aut],
Sneha Murmu [aut],
Sayanti Guha Majumdar [aut],
UB Angadi [aut],
Md Yeasin [aut] |
| Maintainer: |
Subham Ghosh <search4aghosh at gmail.com> |
| License: |
GPL-3 |
| NeedsCompilation: |
no |
| CRAN checks: |
GFFStrandLoc results |
Documentation:
Downloads:
Linking:
Please use the canonical form
https://CRAN.R-project.org/package=GFFStrandLoc
to link to this page.