Why DataViz?

People don't read papers any more ...

... they just look at the graphs

How many 3s?

Data is hard to Understand

How many 3s?

Slide from Heer, Stasko

Better than Tables

https://www.autodeskresearch.com/publications/samestats

Even Stats Can Fail

1, 2, 3, 4, ...

DataViz works because Vision is Powerful

Numbers —> 1 Dimensional

Vision —> 5-8 Dimensions

Why This Talk?

“Perception is a fantasy that (tries to) coincide with reality”

Straight or Bent lines?

... they're straight

See the Triangle?

... it's not really there

DataViz is Hard
is Design

(Good)

No Formulae

No Rules

Many Requirements

Design

Creative

Iterative

M Bostock, https://youtu.be/fThhbt23SGM

Be Minimalistic

Be Clear

Use as little ink or as few pixels for every bit of data as possible.

Keep the ink/data ratio low.

Attention Bottleneck

"You can pay attention to only one aspect of an image at a time ... neural networks in your brain constantly compete for limited attentional resources."

Demonstration of Attention Bottleneck

Red Circle

Left | Right

Attention Bottleneck ...

Purple Circle

Left | Right

Purple Circle

Left | Right

Simplified Information is Surprisingly Effective

Simple lines and edges are actually what your brain is looking for.

And so vision works quite well without all of the visual detail.

Which also frees up the attentional bottleneck

https://speakerdeck.com/cherdarchuk/remove-to-improve-the-data-ink-ratio

Removing "Chart Junk"

https://speakerdeck.com/cherdarchuk/remove-to-improve-the-data-ink-ratio

Rethinking your Graph

MakeComparisons Accurately

Different visual qualities have different accuracies.

Use the one appropriate to your data.

Visual Qualities

Position

Length

Angle

Area

Colour / Brightness

Visual Qualities

Table ... Counts too

Visual Qualities

Sometimes a visualisation doesn't do any better than a table.

Tables are the baseline for assessing the quality of a visualisation.

Colour

A B C D E F G H I J K

.... which is the FOURTH Darkest?

Colour

Colour can suffer from illusions

Which bars are the same and which are different?

Area

A B C D E F G H I J K

.... which is the FOURTH Smallest?

Unfortunately, currently popular ... for example

Angles

Which is second smallest?

Third Smallest?

Angles

Slope Comparisons

Can be relatively accurate when optimised

Slope Comparisons

Differences easier to see when angle is ~45 degrees.

Slope Comparisons

Slope is greater or lesser for the increase or decrease?

Length

Of the blue ... which is the second smallest?

Position

Grey v Blue is easy & clear

Comparing Blue with Blue is harder ...

... Distance makes position accuracy worse

Position

Accurate

Inaccurate

Impressionistic

Less accurate doesn't mean bad

Use less accurate visual forms when the differences being compared are large, or there is a large structure in the data to be shown.

Sometimes you have order your data for the structure to be apparent

Sometimes it is useful to make inaccurate comparisons when the intention is to simply make it evident that there is substantial variation in a measurement, not to allow direct one-to-one comparisons

Here area encodes population for various nations of the world. The idea is that there are large differences in population, and that it doesn't affect the correlation.