Thomas Wielfaert
KU Leuven - Quantitative Lexicology and Variational Linguistics
Digital Humanities Spring Event
29 April 2015
Why?
Increasingly bigger data sets: 3 V's
source: gtcorp.com
The fourth research paradigm (Jim Gray)
Rather than finding data to test a hypothesis, find a hypothesis that can be tested on the existing data.
Black box algorithms
Visual Analytics can reveal properties of algorithms that were not detected before.
source: visual-analytics.eu
What is a good visualisation?
What is not?
Caveats of visual perception
Hermann grid
Checker shadow illusion
Properties of good visualisations
Semiology of graphics (Bertin)
Reminder: types of variables
Ranking of perceptual tasks
Mackinlay (1986)
colorbrewer2.org
Never ever pick colors yourself!
Different kinds of data
source: D3.js gallery
Get inspired
Don't try to reinvent the wheel...
Good starting point: D3.js Gallery
So what is a good visualisation?
Interplay between several factors:
Evaluation:
Is this for me?
Empirical DH fields
How?
Use commercial software
i.e. Tableau (also free version available)
Use commercial software
Pro:
Cons:
Reuse what is freely available
Google's Magic Table (load unsafe scripts to see this)
Program it yourself
Programming languages designed for the job:
Bottom line: extremely flexible and versatile, but comes with a steep learning curve.
Middle ground
R data frame
D3.js
Google Charts
...
R libraries:
Some DH visualisations
DoubleTreeJS (Chris Culy)
Slash/A (Todorova and Chinkina)
Very basic introduction to D3.js
Today 14:45-15:45, MSI 02.18
Hands on: bring your own laptop or befriend someone with a laptop with Chrome or Firefox installed (pretty please).
Topics:
Comments? Questions?
thomas.wielfaert@kuleuven.be
References
Bertin, J., (1967). Sémiologie graphique, Mouton/Gauthier-Villars, Paris.
Collins, J.P., (2009). The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, Redmond, Washington.
Lem, S., Onghena, P., Verschaffel, L., Van Dooren, W. (2013). On the misinterpretation of histograms and box plots. Educational Psychology: An International Journal of Experimental Educational Psychology, 33 (2), 155-174.
Mackinlay, J.D., (1986). Automating the Design of Graphical Presentations of Relational Information, ACM Transactions on Graphics, 5, 110-141.