(C) 2015 THE IRODS CONSORTIUM
Part 1: iRODS Lets You Control Your Data
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
LETS YOU
CONTROL YOUR DATA
AND PROVE IT
(C) 2015 THE IRODS CONSORTIUM
LETS YOU
Control access to data based on any characteristic of the data, connection, user, or resource.
Prove integrity and custody of the data.
Retain, archive, and destroy data according to policy.
(C) 2015 THE IRODS CONSORTIUM
LETS YOU
Control and access data spread across storage in different sites, from different vendors.
Move huge data sets between multiple sites, quickly and verifiably.
Put the right data, in the right place, close to the right people (and out of reach of the wrong people).
(C) 2015 THE IRODS CONSORTIUM
LETS YOU
Avoid buying the same data set twice.
Eliminate manual processing steps.
Keep track of processing steps applied, from raw data to finished product.
Part 2: iRODS is Open Source Data Grid Middleware
(C) 2015 THE IRODS CONSORTIUM
Photo: "Jefferson Memorial Pillars Inside" by Belal Khan, licensed under CC BY 2.0
iRODS is open source data grid middleware for...
What Is iRODS?
Photo: "Jefferson Memorial Pillars Inside" by Belal Khan, licensed under CC BY 2.0
Cheat Sheet
iRODS is open source data grid middleware for...
sits between the file system and the application
↓
← all your storage in a single namespace
← metadata annotation
← über cron
← shared access without compromising policy
iRODS is Middleware
User Application
"Logical" Layer
Storage Environment
"Physical" Layer
storagecluster.example.org:/managed
s3.amazonaws.com:/example/bitbucket
iRODS is Middleware
iRODS Clients
Storage Resources
iRODS presents multiple separate file systems in a unified namespace.
iRODS composable resources permit seamless distribution of files.
Storage Virtualization
Data Discovery
iRODS provides a catalog, the iCAT, that links data and metadata.
Workflow Automation
iRODS lets you use any condition to trigger any action.
Workflow Automation
iRODS lets you use any condition to trigger any action. For example:
Secure Collaboration
Independently managed iRODS zones can be federated.
History
• 15+ Year Development History
• Refactoring
- Pluggability
- Binary packages
• iRODS 4.0 and 4.1
The Future
• Messaging Framework
• Pluggable Rule Engine
• Reference Implementations
• Improved Manageability
- Zone Reporting
- Configuration Management
Who Uses iRODS?
Life Sciences
• Wellcome Trust Sanger Institute (20+ PB of iRODS Data)
• iPlant Collaborative (15k+ Users)
• Lineberger Comprehensive Cancer Center
• International Neuroinformatics Coordinating Facility (INCF)
• BGI
• Pharma
Research Infrastructure
• CC-IN2P3 (6+ PB of iRODS Data)
• EUDAT
• DataVerse Integration
Social Sciences
• Odum Institute
• Utrecht University (Youth Cohort Study)
Federal Users
• National Institute of Environmental Health Sciences (NIEHS)
• NASA
Oil & Gas, Media & Entertainment
Use Case: Sanger Institute
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
attribute: library
attribute: total_reads
attribute: type
attribute: lane
attribute: is_paired_read
attribute: study_accession_number
attribute: library_id
attribute: sample_accession_number
attribute: sample_public_name
attribute: manual_qc
attribute: tag
attribute: sample_common_name
attribute: md5
attribute: tag_index
attribute: study_title
attribute: study_id
attribute: reference
attribute: sample
attribute: target
attribute: sample_id
attribute: id_run
attribute: study
attribute: alignment
(C) 2015 THE IRODS CONSORTIUM
Baton Client
Thin layer over parts of the iRODS C API
● JSON support
● Connection friendly
● Comprehensive logging
● autoconf build on Linux and OSX
Current state
● Metadata listing
● Metadata queries
● Metadata addition
(C) 2015 THE IRODS CONSORTIUM
Additional Use Cases
(C) 2015 THE IRODS CONSORTIUM
A Health Science Institute
NIEHS
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
Enterprise Readiness
• Modular, maintainable code
• Static analysis and continuous integration
• Sustainable funding and governance model
iRODS is free, open source software owned by a foundation called the iRODS Consortium.
Members pay an annual membership fee: 4 levels of membership.
Members have agreed upon iRODS as an area of cooperation, rather than competition.
Two monthly meetings: Technology Working Group (TWG), Planning Committee
Goal is to create a sustainable open source project.
Presently, funds a team of 10+ developers, application engineers, documentation, support staff
Sustainable Governance and Funding Model
+2
Contract Customers
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
Initial Trial
Proof of Concept
Pilot
Production
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM
And Beyond...
(C) 2015 THE IRODS CONSORTIUM
(C) 2015 THE IRODS CONSORTIUM