what is
Data Journalism?
Kendall Taggart
kendall.taggart@buzzfeed.com
(h/t Jennifer LaFleur of The Center for Investigative Reporting)
#Nicar14 job board
Why data journalism?
- Because you can't tell important stories without these skills
EXAMPLE: ESPN examines stadium food
Finding: More than half of the concession stands or restaurants had been cited for at least one "critical" or "major" health violation at some venues.
WHy data journalism?
Because you can't tell important stories without these skills
Pattern recognition: it takes you beyond an anecdote
EXAMPLE: Associated press follows stimulus money
Combining datasets: Bridge inspections + stimulus funding
FINDING: Some of the nation's worst bridges did not get stimulus funding.
There is news in the outliers.
WHY Data Journalism?
Because you can't tell important stories without these skills
Pattern recognition: it takes you beyond an anecdote
Test assumptions
EXAMPLE: California Watch
California has the toughest building regulations.
Schools must be safe.
EXAMPLE: THE Center for investigative Reporting
Press releases disproportionately highlight Mexican citizens
4 of 5 Border Patrol drug busts involve US citizens
WHY Data Journalism?
Because you can't tell important stories without these skills
Pattern recognition: it takes you beyond an anecdote
Test assumptions
Enable people to make stories relevant to their own situation
eXAMPLE: California watch
Schools in our area with faulty construction
Developing A data Frame of mind
If something is:
- inspected
- licensed
- purchased
- enforced
...there is probably a database
DEVELOPING DATA FRAME OF MIND
If there is:
...there is probably a database
DEVELOPing a data frame of mind
Is there a sensor?
Where do you find data?
Online: easy to download or requires scraping
Sometimes you have to create your own data
Usually, it requires a request to a government agency
Requesting data
Every Friday! #FOIAFriday
Sample state and federal FOIAs are available at:
NFOIC also has a list of FOIA experts in each state available on its website.
requesting data
Dealing with pushback
Common excuses:
• It’ll be really expensive.
• It’s too complicated for you to understand.
• It’s too big a dataset.
• It contains private information.
• Privatization – a third party company controls the database.
Requesting data
Strategies:
- Know the law
- Speak nerd
- Request an itemized breakdown of costs
-
Write about it:
Tools for data analysis:
- Spreadsheets (Excel)
- Databases (Access, SQlite, MySQL)
- Statistics (Excel, R, SPSS, Pandas)
- Mapping (Fusion Tables, QGIS, ArcGIS)
- Programming (Python, Ruby)
Questions?
Kendall:
email: kendall.taggart@buzzfeed.com
phone: 646.589.8582