Skip to main content
Metaist

Anscombe's Quartet

Summary #

Anscombe's quartet was created by Francis Anscombe to demonstrate the effect of outliers. All four datasets have the same average, variance, correlation, and linear regression, but look completely different when graphed.

Graphs of Anscombe's quartet

Lies, damned lies, and statistics!

(Image: Wikimedia)

Commentary #

Anscombe's quartet is a neat example of how even large differences between datasets can be hidden by outliers. There is a false sense that the sets are comparable in some way just because they share many statistical properties.

Luckily, a simple graph of the sets reveals the effect. However, graphs can deceive in several other ways:

While I'm interested in the best techniques for presenting data, it's important to understand that there is more to the experience of reading graphs than just presenting the information—our choice of presentation can divert the reader away from the underlying truth about the data.

See Also #