wordpredictor: Develop Text Prediction Models Based on N-Grams

A framework for developing n-gram models for text prediction. It provides data cleaning, data sampling, extracting tokens from text, model generation, model evaluation and word prediction. For information on how n-gram models work we referred to: "Speech and Language Processing" <https://web.archive.org/web/20240919222934/https%3A%2F%2Fweb.stanford.edu%2F~jurafsky%2Fslp3%2F3.pdf>. For optimizing R code and using R6 classes we referred to "Advanced R" <https://adv-r.hadley.nz/r6.html>. For writing R extensions we referred to "R Packages", <https://r-pkgs.org/index.html>.

Version: 0.0.5
Imports: digest, ggplot2, patchwork, stringr, dplyr, SnowballC
Suggests: testthat, covr, knitr, rmarkdown, markdown
Published: 2024-10-08
Author: Nadir Latif ORCID iD [aut, cre]
Maintainer: Nadir Latif <pakjiddat at gmail.com>
BugReports: https://github.com/pakjiddat/word-predictor/issues
License: MIT + file LICENSE
URL: https://github.com/pakjiddat/word-predictor, https://pakjiddat.github.io/word-predictor/
NeedsCompilation: no
Language: en-US
Citation: wordpredictor citation info
Materials: README NEWS
CRAN checks: wordpredictor results [issues need fixing before 2024-10-11]

Documentation:

Reference manual: wordpredictor.pdf
Vignettes: Features (source, R code)
Overview (source, R code)

Downloads:

Package source: wordpredictor_0.0.5.tar.gz
Windows binaries: r-devel: wordpredictor_0.0.3.zip, r-release: wordpredictor_0.0.3.zip, r-oldrel: wordpredictor_0.0.3.zip
macOS binaries: r-release (arm64): wordpredictor_0.0.3.tgz, r-oldrel (arm64): wordpredictor_0.0.3.tgz, r-release (x86_64): wordpredictor_0.0.3.tgz, r-oldrel (x86_64): wordpredictor_0.0.3.tgz
Old sources: wordpredictor archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=wordpredictor to link to this page.