Building arguments on open data
philippe duchesne
phd@highlatitud.es
@pduchesne
OpenBelgium 2018
Louvain-la-Neuve
March 12th, 2018
how annotated open data fragments can rest your case
Open data annotations technologies
Intro

at the service of engaged citizens








engaged citizen
public report
What?
- Explore data portal/catalogs
- Chunk up data and reassemble into a meaningful argument
- Share that argument
- Have others derive work from your findings
Intro
Ultimately...
- goal is to model and share a thought process based on open data
Intro
- has to be
- shareable -> open data
- verifiable -> assessment of authenticity of sources
- reproducible -> standards-based, open description of process

Preamble : Data standards
Tech
as the 3rd star of open data, it should be obvious by now...
but considering the current state of open data, it's better to rub it in
- XLS sheets
- ZIP files
- a link to a webpage that gathers some data
- ....
are not open data standards!
Verifiability & reproduceability
--> capture data lineage
Tech
Towards a Mosaic standard : Metadata standards
- data provenance metadata
- origin
- authenticity
- based on Dublin Core vocabulary
- data processing modelling
- need for data process description vocabulary
- Considered ontologies: OntoDM, DMOP
qualifying annotations
--> per domain controlled vocabularies
Tech
Towards a Mosaic standard : Controlled Vocabularies
- largely based on Open Annotation Model
Tech
Towards a Mosaic standard : Annotation Model

- consolidates URL fragment syntax
- standard modelling language for visualization
Tech
Towards a Mosaic standard : Visualization model

- Vega from Interactive Data Lab
http://idl.cs.washington.edu
Intro
Towards a Mosaic standard
Open Annotation Model

Dublin Core Metadata

Vega viz grammar


+ Domain controlled vocabularies
Process ML
?
Parliament sessions
Augment transcripts with multimedia recordings
Scenarios
Natural disaster report
Scenarios
Annotate official report with supporting data
Political transparency
Commenting a budget declaration
Scenarios
Political accountability
Scenarios
Analyze political behaviour
But also...
Scenarios
- Open Science
- Data Journalism
- ...
ROI : Enriching back the data
What's next



Grounding Data
Annotations model is RDF-based and queriable in SPARQL
--> graph of all annotations constitutes grounding material
for data itself, and gives context and semantics to data
What's next

Grounding Data : integration
What's next
Now imagine all derived works enriching the original catalog ...
philippe duchesne
phd @ highlatitud.es
@ pduchesne
thank you
questions?
this demo material at
http://demo.highlatitud.es/api/mosaics/openbelgium18
.json
.jsonld
.ttl
this presentation at
https://slides.com/philippeduchesne/ob18-annotations
Building arguments on open data
By Philippe Duchesne
Building arguments on open data
- 211