"Data literacy spans a broad set of technologies, skills and processes. It is the ability to read, work with, analyse and argue with data."
JSON
Tweet
Lists
RDF
Spreadsheets
Relational SQL Databases
HTML pages
Text documents
Folders
noSQL Databases
UNIX 1969
Perl
Files and Folders
Apple Lisa 1983
Perl
1979 - Visicalc Invented by Dan Bricklin
Fluid and adaptable
Grid-like, structured
Organised
Calculations
Aggregations, totals, averages, tax
Headers
Clean data!
Validation (Lookup)
Formulas
Pivot tables
vs
Originally a card-based index system
Visualised and used
...becomes...
1960s -> 1970s -> 1980s - Relational Databases
Server
Schemas
Constraints
Queries
ACID. Transactions
DRY TABLES!
SQL in 1974
Indexing
Tables
Ids
Each row has a unique Id/key
Foreign keys
SELECT image, owner, date
from pictures
WHERE owner = 'Tom'
ORDER BY
date DESC
limit 20
People often find them difficult?
The Schema is (deliberately) restrictive
Your data is very complex / unstructured
Your data WILL become more complex
Every single letter used mattered.
Better, faster, more reliable
Work with more data
Visualisation
Better tools
Online
Collaborative
They all "know about each other"
Application Programming Interface
Professional vs Beginner
You can work with Google Data Studio too (if your data is clean)
Even more visualisation tools here
What area? Swiss-army knife vs statistical vs fun vs web vs speed vs £££ vs reliability vs specialised ?
Python, Javascript
...then...
UML
Notation for databases, business processes, services etc
OpenRefine
Orange3