tidyEmoji: Discover, Count and Score Emoji in Text

A tidy toolkit for working with the emoji in any text column, such as social-media posts, product reviews, chat logs or survey responses. Unicode is awkward to handle and not every code point is an emoji, which makes emoji statistics fiddly to obtain. 'tidyEmoji' extracts, counts, categorises and sentiment-scores emoji with grapheme-aware detection (so skin-tone and multi-person sequences stay intact), returning tidy data frames that slot straight into a 'tidyverse' workflow. The bundled emoji sentiment lexicon is from the Emoji Sentiment Ranking of Kralj Novak et al. (2015) <doi:10.1371/journal.pone.0144296>, released under CC BY-SA 4.0.

Version: 0.2.0
Depends: R (≥ 3.5.0)
Imports: dplyr (≥ 1.1.0), emoji, lifecycle, stats, tibble, tidyr, utils
Suggests: rmarkdown, knitr, testthat (≥ 3.0.0), ggplot2, readr, forcats, stringr
Published: 2026-06-17
DOI: 10.32614/CRAN.package.tidyEmoji
Author: Youzhi Yu [aut, cre]
Maintainer: Youzhi Yu <yuyouzhi666 at icloud.com>
BugReports: https://github.com/PursuitOfDataScience/tidyEmoji/issues
License: GPL (≥ 3)
URL: https://pursuitofdatascience.github.io/tidyEmoji/
NeedsCompilation: no
Materials: README, NEWS
CRAN checks: tidyEmoji results

Documentation:

Reference manual: tidyEmoji.html , tidyEmoji.pdf
Vignettes: Introduction to tidyEmoji (source, R code)

Downloads:

Package source: tidyEmoji_0.2.0.tar.gz
Windows binaries: r-devel: tidyEmoji_0.1.1.zip, r-release: tidyEmoji_0.1.1.zip, r-oldrel: tidyEmoji_0.1.1.zip
macOS binaries: r-release (arm64): tidyEmoji_0.2.0.tgz, r-oldrel (arm64): tidyEmoji_0.2.0.tgz, r-release (x86_64): tidyEmoji_0.2.0.tgz, r-oldrel (x86_64): tidyEmoji_0.2.0.tgz
Old sources: tidyEmoji archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tidyEmoji to link to this page.

mirror server hosted at Truenetwork, Russian Federation.