fastMatMR

CRAN_Status_Badge Status at rOpenSci Software Peer Review Lifecycle: stable runiverse-name runiverse-package DOI R-CMD-check pkgcheck Benchmarks

About

fastMatMR provides R bindings for reading and writing to Matrix Market files using the high-performance fast_matrix_market C++ library (version 1.7.6).

Why?

Matrix Market files are crucial to much of the data-science ecosystem. The fastMatMR package focuses on high-performance read and write operations for Matrix Market files, serving as a key tool for data extraction in computational and data science pipelines.

The target audience and scientific applications primarily include data scientists or researchers developing numerical methods who may wish to either test standard NIST (National Institute of Standards and Technology) which include:

comparative studies of algorithms for numerical linear algebra, featuring nearly 500 sparse matrices from a variety of applications, as well as matrix generation tools and services.

Additionally, being able to use the matrix market file format, means it is easier to interface R analysis with those in Python (e.g. SciPy uses the same underlying C++ library). These files can also be used with the Tensor Algebra Compiler (TACO).

Features

We have vignettes for both read and write operations to demonstrate the performance claims.

Alternatives and statement of need

Installation

CRAN

For the latest CRAN version:

install.packages("fastMatMR")

R-Universe

For the latest development version of fastMatMR:

install.packages("fastMatMR",
                 repos = "https://ropensci.r-universe.dev")

Development Git

For the latest commit, one can use:

# install.packages("devtools")
devtools::install_github("ropensci/fastMatMR")

Quick Example

library(fastMatMR)
spmat <- Matrix::Matrix(c(1, 0, 3, 2), nrow = 2, sparse = TRUE)
write_fmm(spmat, "sparse.mtx")
fmm_to_sparse_Matrix("sparse.mtx")

The resulting .mtx file is language agnostic, and can even be read back in python as an example:

pip install fast_matrix_market
python -c 'import fast_matrix_market as fmm; print(fmm.read_array_or_coo("sparse.mtx"))'
((array([1., 3., 2.]), (array([0, 0, 1], dtype=int32), array([0, 1, 1], dtype=int32))), (2, 2))
python -c 'import fast_matrix_market as fmm; print(fmm.read_array("sparse.mtx"))'
array([[1., 3.],
       [0., 2.]])

Similarly, fastMatMR supports writing and reading from other R objects (e.g. standard R vectors and matrices), as seen in the getting started vignette.

Benchmarks

Performance is tracked across PRs using ASV (Airspeed Velocity) with asv-perch for CI comment integration. ASV’s track_* interface runs R benchmarks via Rscript subprocess calls, making the framework language-agnostic while timing actual R code with system.time().

Benchmarks cover sparse and dense Matrix Market read/write operations at varying matrix sizes (100x100 to 1000x1000).

Running locally

# Install the R package into the bench environment
pixi run -e bench R CMD INSTALL --no-docs .

# Validate benchmark definitions
pixi run -e bench asv check

# Run benchmarks against the current commit
pixi run -e bench bash -c "asv machine --yes"
pixi run -e bench asv run \
  -E "existing:$(pixi run -e bench which python)" \
  --set-commit-hash $(git rev-parse HEAD) \
  --record-samples --quick

CI architecture

Two GitHub Actions workflows handle PR benchmarks:

This split ensures fork PRs cannot access write permissions.

Contributing

Contributions are very welcome. Please see the Contribution Guide and our Code of Conduct.

License

This project is licensed under the MIT License.

The logo was generated via a non-commercial use prompt on hotpot.ai, both blue, and green, as a riff on the NIST Matrix Market logo. The text was added in a presentation software (WPS Presentation). Hexagonal cropping was accomplished in a hexb compatible design using hexsticker.

mirror server hosted at Truenetwork, Russian Federation.