Loads and processes huge text corpora processed with the sally toolbox (<http://www.mlsec.org/sally/>). sally acts as a very fast preprocessor which splits the text files into tokens or n-grams. These output files can then be read with the PRISMA package which applies testing-based token selection and has some replicate-aware, highly tuned non-negative matrix factorization and principal component analysis implementation which allows the processing of very big data sets even on desktop machines.
Version: | 0.2-7 |
Depends: | R (≥ 2.10), Matrix, gplots, methods, ggplot2 |
Suggests: | tm (≥ 0.6) |
Published: | 2018-05-26 |
DOI: | 10.32614/CRAN.package.PRISMA |
Author: | Tammo Krueger, Nicole Kraemer |
Maintainer: | Tammo Krueger <tammokrueger at googlemail.com> |
License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2.0)] |
NeedsCompilation: | no |
Materials: | README |
CRAN checks: | PRISMA results |
Reference manual: | PRISMA.pdf |
Vignettes: |
Quick introduction |
Package source: | PRISMA_0.2-7.tar.gz |
Windows binaries: | r-devel: PRISMA_0.2-7.zip, r-release: PRISMA_0.2-7.zip, r-oldrel: PRISMA_0.2-7.zip |
macOS binaries: | r-release (arm64): PRISMA_0.2-7.tgz, r-oldrel (arm64): PRISMA_0.2-7.tgz, r-release (x86_64): PRISMA_0.2-7.tgz, r-oldrel (x86_64): PRISMA_0.2-7.tgz |
Old sources: | PRISMA archive |
Please use the canonical form https://CRAN.R-project.org/package=PRISMA to link to this page.