fastMatMR

fastMatMR provides R bindings for reading and writing to
Matrix Market
files using the high-performance fast_matrix_market
C++ library (version 1.7.6).
Matrix
Market files are crucial to much of the data-science ecosystem. The
fastMatMR package focuses on high-performance read and
write operations for Matrix Market files, serving as a key tool for data
extraction in computational and data science pipelines.
The target audience and scientific applications primarily include data scientists or researchers developing numerical methods who may wish to either test standard NIST (National Institute of Standards and Technology) which include:
comparative studies of algorithms for numerical linear algebra, featuring nearly 500 sparse matrices from a variety of applications, as well as matrix generation tools and services.
Additionally, being able to use the matrix market file format, means
it is easier to interface R analysis with those in
Python (e.g. SciPy uses the same underlying
C++ library). These files can also be used with the Tensor Algebra
Compiler (TACO).
Extended Support: fastMatMR
supports standard R vectors, matrices, Matrix sparse
objects, spam sparse matrices, and SparseM
matrix.csr objects. Gzip-compressed .mtx.gz
files are handled transparently.
Performance: The package is a thin wrapper
around one of the fastest C++ libraries for reading and writing
.mtx files.
Correctness: Unlike Matrix,
roundtripping with NA and NaN values works by
coercing to NaN instead of to arbitrarily high
numbers.
We have vignettes for both read and write operations to demonstrate the performance claims.
Matrix package allows reading and writing sparse
matrices in the .mtx (matrix market) format.
.mtx files, it can only handles sparse
matrices for writing and reading.NA and NaN values produces arbitrarily high
numbers instead of preserving NaN / handling
NAFor the latest CRAN version:
install.packages("fastMatMR")For the latest development version of fastMatMR:
install.packages("fastMatMR",
repos = "https://ropensci.r-universe.dev")For the latest commit, one can use:
# install.packages("devtools")
devtools::install_github("ropensci/fastMatMR")library(fastMatMR)
spmat <- Matrix::Matrix(c(1, 0, 3, 2), nrow = 2, sparse = TRUE)
write_fmm(spmat, "sparse.mtx")
fmm_to_sparse_Matrix("sparse.mtx")The resulting .mtx file is language agnostic, and can
even be read back in python as an example:
pip install fast_matrix_market
python -c 'import fast_matrix_market as fmm; print(fmm.read_array_or_coo("sparse.mtx"))'
((array([1., 3., 2.]), (array([0, 0, 1], dtype=int32), array([0, 1, 1], dtype=int32))), (2, 2))
python -c 'import fast_matrix_market as fmm; print(fmm.read_array("sparse.mtx"))'
array([[1., 3.],
[0., 2.]])Similarly, fastMatMR supports writing and reading from
other R objects (e.g. standard R vectors and matrices), as
seen in the getting
started vignette.
Performance is tracked across PRs using ASV (Airspeed Velocity) with asv-perch for CI comment
integration. ASV’s track_* interface runs R benchmarks via
Rscript subprocess calls, making the framework
language-agnostic while timing actual R code with
system.time().
Benchmarks cover sparse and dense Matrix Market read/write operations at varying matrix sizes (100x100 to 1000x1000).
# Install the R package into the bench environment
pixi run -e bench R CMD INSTALL --no-docs .
# Validate benchmark definitions
pixi run -e bench asv check
# Run benchmarks against the current commit
pixi run -e bench bash -c "asv machine --yes"
pixi run -e bench asv run \
-E "existing:$(pixi run -e bench which python)" \
--set-commit-hash $(git rev-parse HEAD) \
--record-samples --quickTwo GitHub Actions workflows handle PR benchmarks:
ci_benchmark.yml runs on
pull_request events with read-only permissions. It
benchmarks both the base and PR commits using a matrix strategy, then
combines results via asv-spyglass.ci_bench_commenter.yml triggers on
workflow_run completion and posts a comparison table as a
PR comment using asv-perch.This split ensures fork PRs cannot access write permissions.
Contributions are very welcome. Please see the Contribution Guide and our Code of Conduct.
This project is licensed under the MIT License.
The logo was generated via a non-commercial use prompt on hotpot.ai, both blue, and green, as a riff on the NIST Matrix Market logo. The text was added in a presentation software (WPS Presentation). Hexagonal cropping was accomplished in a hexb compatible design using hexsticker.