These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.
Version: | 0.3.3 |
Depends: | R (≥ 3.0.2) |
Imports: | Rcpp, stringdist (≥ 0.9.5.1), stringi |
LinkingTo: | Rcpp, stringdist (≥ 0.9.5.1) |
Suggests: | testthat, knitr, rmarkdown, dplyr |
Published: | 2023-11-12 |
DOI: | 10.32614/CRAN.package.refinr |
Author: | Chris Muir [aut, cre] |
Maintainer: | Chris Muir <chrismuirRVA at gmail.com> |
BugReports: | https://github.com/ChrisMuir/refinr/issues |
License: | GPL-3 |
URL: | https://github.com/ChrisMuir/refinr |
NeedsCompilation: | yes |
Materials: | README NEWS |
CRAN checks: | refinr results |
Reference manual: | refinr.pdf |
Vignettes: |
Intro to refinr |
Package source: | refinr_0.3.3.tar.gz |
Windows binaries: | r-devel: refinr_0.3.3.zip, r-release: refinr_0.3.3.zip, r-oldrel: refinr_0.3.3.zip |
macOS binaries: | r-release (arm64): refinr_0.3.3.tgz, r-oldrel (arm64): refinr_0.3.3.tgz, r-release (x86_64): refinr_0.3.3.tgz, r-oldrel (x86_64): refinr_0.3.3.tgz |
Old sources: | refinr archive |
Please use the canonical form https://CRAN.R-project.org/package=refinr to link to this page.