corpus: Text Corpus Analysis

Text corpus data analysis, with full support for Unicode. Functions for reading data from newline-delimited JSON files, for normalizing and tokenizing text, and for computing term occurrence frequencies.

Version: 0.5.1
Imports: Matrix
Suggests: testthat
Published: 2017-05-25
Author: Patrick O. Perry [aut, cre], Martin Porter and Richard Boulton [ctb, cph] (Snowball), Unicode, Inc. [ctb, cph] (Unicode Character Database)
Maintainer: Patrick O. Perry <pperry at stern.nyu.edu>
BugReports: https://github.com/patperry/r-corpus/issues
License: Apache License (== 2.0) | file LICENSE
URL: https://github.com/patperry/r-corpus
NeedsCompilation: yes
Materials: NEWS
CRAN checks: corpus results

Downloads:

Reference manual: corpus.pdf
Package source: corpus_0.5.1.tar.gz
Windows binaries: r-devel: corpus_0.5.1.zip, r-release: corpus_0.5.1.zip, r-oldrel: corpus_0.5.1.zip
OS X El Capitan binaries: r-release: corpus_0.5.1.tgz
OS X Mavericks binaries: r-oldrel: corpus_0.5.1.tgz
Old sources: corpus archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=corpus to link to this page.