Creative
Data visualizations
Shirley Wu
(an idealized lol)
dataviz workflow
get data → explore data → design → code/make
data:image/s3,"s3://crabby-images/72cef/72cef7f9988fcc83f7d576e907e0f7f623fc16c3" alt=""
(my actual lol)
dataviz workflow
↑
get data
↑
explore
data
↑
design
🔄
↑
code/make
data:image/s3,"s3://crabby-images/4af9e/4af9e2e6e59efa61b3f3322902a80ba65eae7b3f" alt=""
so over the years, I've refined a process and collected tools to reduce the time it takes to finish a project
but everyone's process is a little bit different. I offer my process as a starting point, but feel free to experiment and see what works for you.
let's get started →
get data
collect your own data
✨
find pre-existing data
Notebooks/phones
Excel
APIs
↓
more intimately understand dataset
get data
collect your own data
find pre-existing data
look for outliers, missing values, duplicates
↘ ↙
start with a curiosity!
↘ ↙
clean & validate your data
Excel
R, Python
Charts
ChatGPT
data:image/s3,"s3://crabby-images/5bb21/5bb21fa9e8269700e37da85156c628b831e71692" alt=""
explore data
- List data attributes
- Ask questions
- Explore the data
explore data:
data types
- Categorical (movie genres)
- Ordinal (t-shirt sizes)
- Quantitative (ratings/scores)
- Temporal (dates)
- Spatial (cities)
exercise
List all the attributes,
ask all the questions!
explore data:
some basic Chart types
Bar chart
For categorical comparisons
Domain: categorical
Range: quantitative
data:image/s3,"s3://crabby-images/aa1ce/aa1ce3d70fef1c88cfafc1c06635941eeede3903" alt=""
EXPLORE DATA:
some basic Chart types
Histogram
For categorical distributions
Domain: quantitative bins
Range: frequency of quantitative bin
data:image/s3,"s3://crabby-images/0413e/0413ed76b6565bccb5a3057f9d147281d6edb209" alt=""
EXPLORE DATA:
some basic Chart types
Scatterplot
For correlation
2 categories, and the relationship between their quantitative values
data:image/s3,"s3://crabby-images/5aa78/5aa78b9a391855075f3e26fa5796879bf894d8d0" alt=""
EXPLORE DATA:
some basic Chart types
Line chart
For temporal trends
Domain: temporal
Range: quantitative
data:image/s3,"s3://crabby-images/d440f/d440fca813ac204bde10b9469d04f0215e66c159" alt=""
EXPLORE DATA:
tools
how to choose charts:
charting tools:
Excel
R, Python
⚠️ ChatGPT
exercise
Brainstorm some charts
to answer the questions.
Quickly sketch them
and how you'd map the data.
explore data:
advice for bigger datasets
- Check for missing data, and the validity of the data
- Focus on one question at a time (it's very easy to get sidetracked with a tangent)
- If there IS an interesting tangent, make a note for later
- If the question leads to a dead-end, explore another question or the tangent you found earlier
- Don't be afraid to go out and look for additional data to aid your exploration
- Sometimes, no interesting pattern IS very interesting
data to design
- Concentrate on the takeaways to communicate across
- What does that mean in terms of the data? (Individual or aggregate elements? Which attributes?)
- Map the relevant data to visual elements
Design:
marks & channels
Map individual or
aggregate data
elements to marks.
Map data attributes
to channels.
Design:
marks
data:image/s3,"s3://crabby-images/5d35d/5d35d1189385e33a152748865234a35769a81b45" alt=""
Visualization Analysis and Design. Tamara Munzner, with illustrations by Eamonn Maguire. A K Peters Visualization Series, CRC Press, 2014.
Design:
channels
Visualization Analysis and Design. Tamara Munzner, with illustrations by
Eamonn Maguire. A K Peters Visualization Series, CRC Press, 2014.
data:image/s3,"s3://crabby-images/83039/830391d9ec3ce251bfae2966cd6872cec38adf7d" alt=""
Quantitative
- Position
- Size
- Color
Categorical
- Shape
- Texture
- Color
Temporal
- Animation
Design:
marks & channels
Visualization Analysis and Design. Tamara Munzner, with illustrations by
Eamonn Maguire. A K Peters Visualization Series, CRC Press, 2014.
data:image/s3,"s3://crabby-images/f9e6f/f9e6f4bdb98234fad2801041569ae9425708c797" alt=""
mark
bar
channels
x: category
y: quant
mark
point
channels
x: quant
y: quant
mark
point
channels
x: quant
y: quant
color: category
mark
point
channels
x: quant
y: quant
color: category
size: quant
Design:
channel effectiveness
Visualization Analysis and Design. Tamara Munzner, with illustrations by
Eamonn Maguire. A K Peters Visualization Series, CRC Press, 2014.
data:image/s3,"s3://crabby-images/a1be2/a1be250062ded1d4de4250bfcb82cdbed5f86e6d" alt=""
Design:
marks & channels
-
One-to-one mapping of data to channel
-
Multiple mappings of channel to mark (x, y, size, color usually)
-
Do not EVER map multiple data attributes to the same channel
readability
Titles, descriptions, and legends
to explain the visualization
Axes and annotations
to describe the data
Resource:
Visual metaphors
Simulated Dendrochronology of U.S. Immigration 1790-2016
United States gun death data visualization
(cw: gun deaths)
Poppy Field - Visualising War Fatalities
(cw: war, death)
exercise
Create a more refined sketch, keeping in mind marks, channels, and visual metaphors
Design:
Advice
- Always keep your audience in mind, and what you are trying to communicate
- Most of these are guidelines, not strict rules—but be intentional when you choose not to follow them
- Always test your intuition by asking for feedback
design to code
-
Break it down! What do you need to draw the marks? What do you need to calculate the channels?
-
To draw marks: SVG (or HTML5 Canvas)
-
To calculate channels: D3 scales, shapes, and layouts (or straight-up math!)
SVG Elements
rect
x: x-coordinate of top-left
y: y-coordinate of top-left
width
height
circle
cx: x-coordinate of center
cy: y-coordinate of center
r: radius
text
x: x-coordinate
y: y-coordinate
dx: x-coordinate offset
dy: y-coordinate offset
text-anchor: horizontal text alignment
Hi!
path
d: path to follow
Moveto, Lineto, Curveto, Arcto
code:
D3 api
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
For translating raw data to what SVG needs to draw
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
Take output of layout calculations and draw SVG elements
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
Sometimes all you need are scales to get from data to screen space
Often times, you may need specific layouts.
These output x/y positions
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
data:image/s3,"s3://crabby-images/0413e/0413ed76b6565bccb5a3057f9d147281d6edb209" alt=""
data:image/s3,"s3://crabby-images/7f665/7f66511cf8d598e886de08e33786840e750ba3d4" alt=""
data:image/s3,"s3://crabby-images/a15f7/a15f75376bad7e3318ee14d1fadd96b43121c386" alt=""
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
data:image/s3,"s3://crabby-images/8ff1e/8ff1ef6fd53beec0108f112c54707d124f7ddc0a" alt=""
data:image/s3,"s3://crabby-images/2e506/2e506e9e206732071ab1bc147c499c950a9b4bf4" alt=""
data:image/s3,"s3://crabby-images/b0849/b0849b0c51ea57b4e249adc65366f126fb701e94" alt=""
data:image/s3,"s3://crabby-images/0e635/0e6352d0b81f810bdd8ad6c0f3114d189d3359e2" alt=""
And these generate path commands
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
data:image/s3,"s3://crabby-images/31ae2/31ae28b798be7c6e9eb1572dcbabdf39f70c2077" alt=""
data:image/s3,"s3://crabby-images/087a7/087a7c775be4ace7ac0495ca4296ad2a18f0d4e3" alt=""
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
Great dataviz-specific interactions
data:image/s3,"s3://crabby-images/e0790/e0790f5bcb7bfb165aca1f6da3dbc57133292e6a" alt=""
more svg for
context & aesthetics
- Patterns
- Gradients
- Text on a path
- SVG filters
(blurs, drop-shadows) - Clipping & masking
data:image/s3,"s3://crabby-images/a208a/a208a54ad4473211f62eee2fefb8fc01dfcb52bd" alt=""
data:image/s3,"s3://crabby-images/eff7b/eff7ba3faf68ee6a29093deed65acf130f4520de" alt=""
data:image/s3,"s3://crabby-images/4cd90/4cd90cf9a41e6d586d8a320d1c3395c53ac74e94" alt=""
data:image/s3,"s3://crabby-images/08222/08222ff656c115aa88185847deeab9052e57f198" alt=""
code:
advice
code:
d3 alternatives
↰
← great for interactivity
← great for prototyping
resources
Books:
The Functional Art by Alberto Cairo
Online:
Data Visualization Society
Information is Beautiful Awards
Creative data visualizations (2024)
By Shirley Wu
Creative data visualizations (2024)
- 541