Stian Soiland-Reyes
eScience lab, The University of Manchester
INDElab, University of Amsterdam
Dataverse community meeting 2021
Software Metadata and Containerization
2021-06-17
This work is licensed under a
Creative Commons Attribution 4.0 International License.
RO-Crate is method for describing a dataset as a digital object using a single linked-data metadata document
Credit: Peter Sefton
Adapted from https://arkisto-platform.github.io/standards/ro-crate/
The dataset may contain any kind of
data resource, about anything, in any format
as a file or URL
Credit: Peter Sefton
Adapted from https://arkisto-platform.github.io/standards/ro-crate/
Each resource can have a machine readable description in JSON-LD format
Credit: Peter Sefton
Adapted from https://arkisto-platform.github.io/standards/ro-crate/
A human-readable description/preview can be in an HTML file that lives alongside the metadata
Credit: Peter Sefton
Adapted from https://arkisto-platform.github.io/standards/ro-crate/
Provenance and workflow information can be included
– to assist in re-use of data and research processes
Credit: Peter Sefton
Adapted from https://arkisto-platform.github.io/standards/ro-crate/
RO-Crate Digital Objects may be packaged for distribution eg via Zip, Bagit and OCFL
– or simply be published on the Web
Credit: Peter Sefton
Adapted from https://arkisto-platform.github.io/standards/ro-crate/
Credit: Marco La Rosa, Peter Sefton
Credit: Marco La Rosa, Peter Sefton
https://arkisto-platform.github.io/tools/description/describo-online/
Credit: Tomasz Miksa et al
https://doi.org/10.4126/FRL01-006423291
Use case #1: From exemplar RO-Crate generate maDMP
Use case #2: From maDMP generate template RO-Crate
Metadata held alongside hetereogeneous data
Exchange mechanism (import/export)
Avoid vendor lock-in
Credit: Thanasis Vergoulis
Credit: Simone Leo
RO-Crate minimal provenance: Some software was used
Credit: José Mª Fernández, ELIXIR All Hands, 2021-06-11
Credit: Paolo Manghi
https://doi.org/10.5281/zenodo.4916734
Text
Credit: Oscar Corcho, Carole Goble
https://doi.org/10.5281/zenodo.4913285
Credit: Carole Goble
Dataverse Community Meeting 2021
Warning: JSON ahead
schema.org JSON-LD from DataVerse
95% RO-Crate
Where did that lovely metadata go..?
Where's the DOI?
How do we know it's from a Dataverse?
What is this dataset called?
Avoid re-filling metadata already captured
(e.g. from workflow system or Describo)
Move RO-Crate between repositories
Build RO-Crate early and incrementally -
not just at Dataverse deposit time
Self-publish RO-Crate (e.g. project website)
with Dataverse references & DOI
The RO-Crate Community is open for anyone to join us!