iRODS in the Cloud:
Organizational Data Management
Terrell Russell, Ph.D.
@terrellrussell
Executive Director, iRODS Consortium
November 12-17, 2023
Supercomputing 2023
Denver, CO
Our Membership
Consortium
Member
Consortium
Member
Consortium
Member
What is iRODS
Open Source
Distributed
Data Centric & Metadata Driven
History
iRODS as the Integration Layer
iRODS Core Competencies
The Data Management Model
Ingest to Institutional Repository
As data matures and reaches a broader community, data management policy must also evolve to meet these additional requirements.
Data Management
"The development, execution and supervision of plans, policies, programs, and practices that control, protect, deliver, and enhance the value of data and information assets."
Most organizations are still managing their assets with a collection of small scripts, tribal knowledge, vigilance, and hope.
Organizations, instead, need a future-proof solution to managing data and its surrounding infrastructure.
Hard Problems - Today
Data Management
Multiple pieces
Multiple meanings
Multiple goals
Data Management
Policy Enforcement - Through the Years
People with Keys + Notes/Reports
Passwords + Folders + Scripts (Maybe)
Credentials + Metadata + Automation
Data Management
Fraught with People
Data Management
These long-term management tasks are too much for a curator or librarian, and certainly too much for the scientists themselves, to handle by hand.
There must be organizational policy in place to handle the varied scenarios of data retention, data access, and data use.
There must be automation in place to provide consistency and confidence in the process.
Confidence in tools comes from open frameworks and common, observable patterns in behavior and interoperability.
Data Management
ONLY with the automation of policy can your system provide the types of guarantees that you are actually interested in
Leaving the humans in charge of policy enforcement is a mistake.
Data management
should be
data-centric and metadata driven.
Future-proof automated data management
requires
open formats and open source.
Questions?
Thank you.
Terrell Russell
@terrellrussell
iRODS Consortium