whisper: Native R 'torch' Implementation of 'OpenAI' 'Whisper'

Speech-to-text transcription using a native R 'torch' implementation of 'OpenAI' 'Whisper' model <https://github.com/openai/whisper>. Supports multiple model sizes from tiny (39M parameters) to large-v3 (1.5B parameters) with integrated download from 'HuggingFace' <https://huggingface.co/> via the 'hfhub' package. Provides automatic speech recognition with optional language detection and translation to English. Audio preprocessing, mel spectrogram computation, and transformer-based encoder-decoder inference are all implemented in R using the 'torch' package.

Version: 0.1.0
Imports: torch, av, jsonlite, hfhub, safetensors, stats, utils
Suggests: tinytest
Published: 2026-02-06
DOI: 10.32614/CRAN.package.whisper (may not be active yet)
Author: Troy Hernandez [aut, cre], cornball.ai [cph], OpenAI [cph] (Whisper model architecture and mel filterbank data (MIT license))
Maintainer: Troy Hernandez <troy at cornball.ai>
BugReports: https://github.com/cornball-ai/whisper/issues
License: MIT + file LICENSE
URL: https://github.com/cornball-ai/whisper
NeedsCompilation: no
Materials: README, NEWS
CRAN checks: whisper results

Documentation:

Reference manual: whisper.html , whisper.pdf

Downloads:

Package source: whisper_0.1.0.tar.gz
Windows binaries: r-devel: not available, r-release: not available, r-oldrel: not available
macOS binaries: r-release (arm64): not available, r-oldrel (arm64): not available, r-release (x86_64): not available, r-oldrel (x86_64): whisper_0.1.0.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=whisper to link to this page.