tmfast: Fast Topic Models Using Varimax

Fits topic models using varimax-rotated principal component analysis (PCA), following the "vintage factor analysis" approach of Rohe & Zheng (2020) <doi:10.48550/arXiv.2004.05387>. Leverages truncated PCA via 'irlba' for sparse matrices, enabling fast model fitting on large corpora. Includes an information-theoretic approach to vocabulary selection, 'broom'-compatible tidiers for extracting word-topic and topic-document matrices into a tidy data workflow, and samplers for constructing simulated corpora for benchmarking and method evaluation.

Version: 0.1.1
Depends: R (≥ 4.1.0)
Imports: assertthat, purrr, dplyr, tidyr, magrittr, rlang, stringr, tibble, tidyselect, irlba, tidytext, glue, Matrix, generics, psych, cli
Suggests: testthat (≥ 3.0.0), knitr, rmarkdown, ggbeeswarm, ggplot2, Rtsne, umap, lpSolve, janeaustenr, stm, tictoc, furrr, reshape2, tmfast.realbooks
Published: 2026-05-30
DOI: 10.32614/CRAN.package.tmfast (may not be active yet)
Author: D. Hicks ORCID iD [aut, cre, cph]
Maintainer: D. Hicks <hicks.daniel.j at gmail.com>
BugReports: https://github.com/dhicks/tmfast/issues
License: GPL (≥ 3)
URL: https://dhicks.github.io/tmfast/, https://github.com/dhicks/tmfast
NeedsCompilation: no
Additional_repositories: https://dhicks.github.io/drat/
Materials: README, NEWS
CRAN checks: tmfast results

Documentation:

Reference manual: tmfast.html , tmfast.pdf
Vignettes: Fast topic modeling with real books (source, R code)
Fitting topic models with tmfast (source, R code)

Downloads:

Package source: tmfast_0.1.1.tar.gz
Windows binaries: r-devel: not available, r-release: not available, r-oldrel: not available
macOS binaries: r-release (arm64): not available, r-oldrel (arm64): not available, r-release (x86_64): not available, r-oldrel (x86_64): not available

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tmfast to link to this page.

mirror server hosted at Truenetwork, Russian Federation.