RMVL: Mappable Vector Library for Handling Large Datasets

Mappable vector library provides convenient way to access large datasets. Use all of your data at once, with few limits. Memory mapped data can be shared between multiple R processes. Access speed depends on storage medium, so solid state drive is recommended, preferably with PCI Express (or M.2 nvme) interface or a fast network file system. The data is memory mapped into R and then accessed using usual R list and array subscription operators. Convenience functions are provided for merging, grouping and indexing large vectors and data.frames. The layout of underlying MVL files is optimized for large datasets. The vectors are stored to guarantee alignment for vector intrinsics after memory map. The package is built on top of libMVL, which can be used as a standalone C library. libMVL has simple C API making it easy to interchange datasets with outside programs. Large MVL datasets are distributed via Academic Torrents <https://academictorrents.com/collection/mvl-datasets>.

Version: 1.1.0.0
Depends: R (≥ 3.5.0)
Published: 2024-05-08
DOI: 10.32614/CRAN.package.RMVL
Author: Vladimir Dergachev ORCID iD [aut, cre]
Maintainer: Vladimir Dergachev <support at altumrete.com>
License: LGPL-2.1
URL: https://academictorrents.com/collection/mvl-datasets, https://github.com/volodya31415/RMVL, https://github.com/volodya31415/libMVL
NeedsCompilation: yes
CRAN checks: RMVL results

Documentation:

Reference manual: RMVL.pdf

Downloads:

Package source: RMVL_1.1.0.0.tar.gz
Windows binaries: r-devel: RMVL_1.1.0.0.zip, r-release: RMVL_1.1.0.0.zip, r-oldrel: RMVL_1.1.0.0.zip
macOS binaries: r-release (arm64): RMVL_1.1.0.0.tgz, r-oldrel (arm64): RMVL_1.1.0.0.tgz, r-release (x86_64): RMVL_1.1.0.0.tgz, r-oldrel (x86_64): RMVL_1.1.0.0.tgz
Old sources: RMVL archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=RMVL to link to this page.