Data Management Design Patterns
Terrell Russell, Ph.D.
@terrellrussell
Chief Technologist, iRODS Consortium
Data Management Design Patterns
November 14-16, 2017
Supercomputing 2017
Denver, CO
Why Data Management Matters
As data matures and reaches a broader community, data management policy must also evolve to meet these additional requirements.
iRODS is
A flexible framework for the abstraction of infrastructure
iRODS as the Integration Layer
iRODS Build and Test - Today
Spring 2015 - onwards
Changes Since 2017
irods build logic moved out of ansible
workflow to test all plugins
run-script-on-vms
History
20+ year legacy
Our Membership
Community Driven
Input from the Open Source Community
All with the Expectation of Public Discourse and Disclosure
Discovered a common enabling practice...
(aka metadata)
Annotation with meaning
Annotation is both descriptive and prescriptive.
It is useful
Metadata Everywhere
With the appropriate abstractions, everything in a system can be described with metadata and therefore, all actions within a system can be driven by that metadata.
Metadata Driven Patterns:
Metadata Templates
iRODS Capabilities
From Prototype to Production
Provenance and Reporting
Data to Compute Pattern
Compute to Data Pattern
An open community-driven process
But, it also
Discovering Design Patterns
Thank you!
iRODS Consortium @iRODS
RENCI @RENCI
Booth #437
Terrell Russell, Ph.D.
@terrellrussell
Chief Technologist, iRODS Consortium