How many 3s?
How many 3s?
Slide from Heer, Stasko
Even Stats Can Fail
1, 2, 3, 4, ...
DataViz works because Vision is Powerful
Numbers —> 1 Dimensional
Vision —> 5-8 Dimensions
No Formulae
No Rules
Many Requirements
Creative
Iterative
M Bostock, https://youtu.be/fThhbt23SGM
"Through the graph ... we should see something that would have been harder to see otherwise."
Use as little ink or as few pixels for every bit of data as possible.
Keep the ink/data ratio low.
"You can pay attention to only one aspect of an image at a time ... neural networks in your brain constantly compete for limited attentional resources."
Simple lines and edges are actually what your brain is looking for.
And so vision works quite well without all of the visual detail.
Which also frees up the attentional bottleneck
Different visual qualities have different accuracies.
Use the one appropriate to your data.
Position
Length
Angle
Area
Colour / Brightness
Table ... Counts too
Sometimes a visualisation doesn't do any better than a table.
Tables are the baseline for assessing the quality of a visualisation.
A B C D E F G H I J K
.... which is the FOURTH Darkest?
Colour can suffer from illusions
Which bars are the same and which are different?
A B C D E F G H I J K
.... which is the FOURTH Smallest?
Unfortunately, currently popular ... for example
Which is second smallest?
Third Smallest?
Can be relatively accurate when optimised
Differences easier to see when angle is ~45 degrees.
Slope is greater or lesser for the increase or decrease?
Of the blue ... which is the second smallest?
Grey v Blue is easy & clear
Comparing Blue with Blue is harder ...
... Distance makes position accuracy worse
Accurate
Inaccurate
Impressionistic
Less accurate doesn't mean bad
Use less accurate visual forms when the differences being compared are large, or there is a large structure in the data to be shown.
Sometimes you have order your data for the structure to be apparent
Sometimes it is useful to make inaccurate comparisons when the intention is to simply make it evident that there is substantial variation in a measurement, not to allow direct one-to-one comparisons
Here area encodes population for various nations of the world. The idea is that there are large differences in population, and that it doesn't affect the correlation.
The eye is looking for shapes and curves.
Use that.
What are the differences between these two data sets?
How predictable was the differences curve?
Often, graphing the differences is better or necessary
As, when curves are close and of similar slopes, we begin to see new shapes, not the axial distance between them.
The eye can focus on and ignore visual elements depending on shared visual qualities.
Angle
Size
Shape (many variations)
Colour / Brightness
Using different colors helps
Using different shapes helps
The crosses are the most different and pop out the most
Using different shapes + thickness/size/darkness is better
Combining differences in colour + shape + darkness + size is best
Open circles are the best ...
Their intersections are visually different from circles.
Overlapping squares unfortunately create new squares etc.
Don't be afraid to add annotations or guiding graphics.
Use popping out features to make them distinguishable from the actual dataViz.
Differences in thickness result in differences in apparent brightness. Grid lines and dataViz become easy to distinguish.
Colour is beautiful but easy to use badly.
Colour is three things ...
Hue
Saturation
Lightness / Brightness
Bottleneck issues.
Use < 9 colours.
Illogical &
uneven
Even & Linear
Even & Linear
Illogical &
uneven
Illogical &
uneven
Zero
Max
Min
... quantitative + Categorical
... make sure there is a reason or a zero point for diverging scales
Illogical &
uneven
Illogical &
uneven
Maximum Range
Illogical &
uneven
Maximum Range
&
Still Pretty!
Effective Application of these Guiding Principles
Position encodes values
Colour encodes categories
Curves display correlations
Angles display correlation size