
Animated examples from Autodesk called the "Datasaurus Dozen".The German national educational standards state explicitly that students should be enabled to successfully interact with dynamic geometry software.Dynamic Applet made in GeoGebra showing the data & statistics and also allowing the points to be dragged (Set 5).Department of Physics, University of Toronto.

"Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing". ^ Matejka, Justin Fitzmaurice, George (2017).^ Andrienko, Natalia Andrienko, Gennady Fuchs, Georg Slingsby, Aidan Turkay, Cagatay Wrobel, Stefan (2020), "Visual Analytics for Investigating and Processing Data", Visual Analytics for Data Scientists, Cham: Springer International Publishing, pp. 151–180, doi: 10.1007/978-6-8_5, ISBN 978-5-1, S2CID 226648414, retrieved.Decision Sciences Journal of Innovative Education. "Generating data sets for teaching the importance of regression analysis".

Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems: 1290–1294. "Generating Data with Identical Statistics but Dissimilar Graphics: A follow up to the Anscombe dataset".
#REGRESSION LINE FROM DATA IN GEOGEBRA CLASSIC PLUS#
One of these, the Datasaurus Dozen, consists of points tracing out the outline of a dinosaur, plus twelve other data sets that have the same summary statistics. Since its publication, several methods to generate similar data sets with identical statistics and dissimilar graphics have been developed.

It is not known how Anscombe created his datasets. The x values are the same for the first three datasets. The quartet is still often used to illustrate the importance of looking at a set of data graphically before starting to analyze according to a particular type of relationship, and the inadequacy of basic statistic properties for describing realistic datasets.

The second graph (top right) while a relationship between the two variables is obvious, it is not linear, and the Pearson correlation coefficient is not relevant.The first scatter plot (top left) appears to be a simple linear relationship, corresponding to two variables correlated where y could be modelled as gaussian with mean linearly dependent on x.Coefficient of determination of the linear regression : R 2
