A framework for developing n-gram models for text prediction. It provides data cleaning, data sampling, extracting tokens from text, model generation, model evaluation and word prediction. For information on how n-gram models work we referred to: "Speech and Language Processing" <https://web.archive.org/web/20240919222934/https%3A%2F%2Fweb.stanford.edu%2F~jurafsky%2Fslp3%2F3.pdf>. For optimizing R code and using R6 classes we referred to "Advanced R" <https://adv-r.hadley.nz/r6.html>. For writing R extensions we referred to "R Packages", <https://r-pkgs.org/index.html>.
Version: | 0.0.5 |
Imports: | digest, ggplot2, patchwork, stringr, dplyr, SnowballC |
Suggests: | testthat, covr, knitr, rmarkdown, markdown |
Published: | 2024-10-08 |
Author: | Nadir Latif [aut, cre] |
Maintainer: | Nadir Latif <pakjiddat at gmail.com> |
BugReports: | https://github.com/pakjiddat/word-predictor/issues |
License: | MIT + file LICENSE |
URL: | https://github.com/pakjiddat/word-predictor, https://pakjiddat.github.io/word-predictor/ |
NeedsCompilation: | no |
Language: | en-US |
Citation: | wordpredictor citation info |
Materials: | README NEWS |
CRAN checks: | wordpredictor results [issues need fixing before 2024-10-11] |
Reference manual: | wordpredictor.pdf |
Vignettes: |
Features (source, R code) Overview (source, R code) |
Package source: | wordpredictor_0.0.5.tar.gz |
Windows binaries: | r-devel: wordpredictor_0.0.3.zip, r-release: wordpredictor_0.0.3.zip, r-oldrel: wordpredictor_0.0.3.zip |
macOS binaries: | r-release (arm64): wordpredictor_0.0.3.tgz, r-oldrel (arm64): wordpredictor_0.0.3.tgz, r-release (x86_64): wordpredictor_0.0.3.tgz, r-oldrel (x86_64): wordpredictor_0.0.3.tgz |
Old sources: | wordpredictor archive |
Please use the canonical form https://CRAN.R-project.org/package=wordpredictor to link to this page.