Native R 'torch' Implementation of 'OpenAI' 'Whisper' [R package whisper version 0.4.0]

Troy Hernandez

whisper: Native R 'torch' Implementation of 'OpenAI' 'Whisper'

Speech-to-text transcription using a native R 'torch' implementation of 'OpenAI' 'Whisper' model <https://github.com/openai/whisper>. Supports multiple model sizes from tiny (39M parameters) to large-v3 (1.5B parameters) with integrated download from 'HuggingFace' <https://huggingface.co/> via the 'hfhub' package. Provides automatic speech recognition with optional language detection and translation to English. Audio preprocessing, mel spectrogram computation, and transformer-based encoder-decoder inference are all implemented in R using the 'torch' package.

Version:	0.4.0
Imports:	torch (≥ 0.17.0), av, jsonlite, hfhub, safetensors, stats, utils
Suggests:	tinytest
Published:	2026-06-20
DOI:	10.32614/CRAN.package.whisper
Author:	Troy Hernandez [aut, cre], cornball.ai [cph], OpenAI [cph] (Whisper model architecture and mel filterbank data (MIT license))
Maintainer:	Troy Hernandez <troy at cornball.ai>
BugReports:	https://github.com/cornball-ai/whisper/issues
License:	MIT + file LICENSE
URL:	https://github.com/cornball-ai/whisper
NeedsCompilation:	no
Materials:	README, NEWS
CRAN checks:	whisper results

Documentation:

Reference manual:

whisper.html , whisper.pdf

Downloads:

Package source:	whisper_0.4.0.tar.gz
Windows binaries:	r-devel: whisper_0.4.0.zip, r-release: whisper_0.4.0.zip, r-oldrel: whisper_0.4.0.zip
macOS binaries:	r-release (arm64): whisper_0.4.0.tgz, r-oldrel (arm64): whisper_0.4.0.tgz, r-release (x86_64): whisper_0.4.0.tgz, r-oldrel (x86_64): whisper_0.4.0.tgz
Old sources:	whisper archive

Reverse dependencies:

Reverse suggests:

stt.api

Linking:

Please use the canonical form https://CRAN.R-project.org/package=whisper to link to this page.