datasauRus: Datasets from the Datasaurus Dozen

The Datasaurus Dozen is a set of datasets with the same summary statistics. They retain the same summary statistics despite having radically different distributions. The datasets represent a larger and quirkier object lesson that is typically taught via Anscombe's Quartet (available in the 'datasets' package). Anscombe's Quartet contains four very different distributions with the same summary statistics and as such highlights the value of visualisation in understanding data, over and above summary statistics. As well as being an engaging variant on the Quartet, the data is generated in a novel way. The simulated annealing process used to derive datasets from the original Datasaurus is detailed in "Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing" <>.

Version: 0.1.2
Depends: R (≥ 3.0.0)
Suggests: covr, testthat, knitr, rmarkdown, ggplot2
Published: 2017-05-08
Author: Steph Locke [cre, aut], Alberto Cairo [dtc], Justin Matejka [dtc], George Fitzmaurice [dtc], Lucy D'Agostino McGowan [aut]
Maintainer: Steph Locke <stephanie.g.locke at>
License: MIT + file LICENSE
NeedsCompilation: no
Materials: README NEWS
CRAN checks: datasauRus results


Reference manual: datasauRus.pdf
Vignettes: The Datasaurus data package
Package source: datasauRus_0.1.2.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X El Capitan binaries: r-release: datasauRus_0.1.2.tgz
OS X Mavericks binaries: r-oldrel: datasauRus_0.1.2.tgz


Please use the canonical form to link to this page.