Challenges in data integration 

Jan Philipp Dietrich
dietrixyzch@pik-xyzpotsdam.de

How GLASSNET might be able to help?

Data Accessibility

unlicensed data of unknown structure and location

semantic interoperable open access data

GLASSNET recommendations

Technical Accessibility

download via static URL

same URL = same data

                  new data = new URL

use of easy machine-readable formats
no PDF, XLXS, ...

License

use of standard licences 

apply a license and share it with the data

use open licenses

Source: Mimi and Eunice  | CC-BY-SA Nina Paley

Source: Mimi and Eunice  | CC-BY-SA Nina Paley

GLASSNET could recommend licences to avoid incompatibilities
(copyleft/non-copyleft)

Metadata

supply metadata,
e.g. citation information, version,...

GLASSNET could make metadata recommendations

provide metadata in a human- and machine-readable form

Advanced steps

Share data processing workflows

shared ontologies (agree on vocabulary to describe data sets)

shared platform for data discovery (make it easier to discover what it available)

Further Reading