Terrell Russell, Ph.D
Executive Director, iRODS Consortium
Director of Data Management, RENCI
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826819/hex_title_page_bg_2020_widescreen.jpg)
November 17-22, 2024
Supercomputing 2024
Atlanta, GA
Building an
Approachable Cost-Effective Data Management Platform
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11640835/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11715323/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/4258496/iRODS-Consortium-Logo-bigger.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/4258498/RENCI-Official-Logo-White.jpg)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/5427229/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Our Membership
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/3887171/bayer_logo.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/3887181/renci_logo.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/3887184/ucl_logo.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/3887185/utrecht_logo.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/4979204/surf_logo.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/4979201/niehs_logo.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/4979200/groningen_logo.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/5469364/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/5469366/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/6694971/maastricht_logo.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/6745605/KULEUVEN_CMYK_LOGO.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/7433891/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/9117948/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/9389705/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/10441434/pasted-from-clipboard.png)
![](https://www.windsor-group.com.au/wp-content/uploads/2019/01/corporate-bbc-membership-icon.png)
Consortium
Member
![](https://www.windsor-group.com.au/wp-content/uploads/2019/01/corporate-bbc-membership-icon.png)
Consortium
Member
![](https://www.windsor-group.com.au/wp-content/uploads/2019/01/corporate-bbc-membership-icon.png)
Consortium
Member
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/10441438/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/9978094/ua_stack_rgb_4_0.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11165354/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11224874/graudata.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11224875/globus.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11640809/pasted-from-clipboard.png)
Mission
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Long-term data management is best executed when policies are clear and infrastructure is abstracted and swappable.
iRODS has a desire to be normal and boring for the administrator and approachable and powerful for the user.
This talk will cover recent advances and interfaces which allow companies to sustain FAIR data practices, enforce consistency and reproducibility, and realize cost-savings through open source software.
Partners and Users: Past and Present
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
- Supercomputing Centers
- Physics
- Library / Archives
- Genomics
- Bio / Pharmaceutical
- Hydrology / Weather
- Medical
- Manufacturing
- Shipping / Logistics
- Automotive
Working with DDN on the AI Stack
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11715323/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7451001/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11715328/NVIDIA-logo-white-16x9.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11715846/Screen_Shot_2024-11-04_at_7.35.17_PM.png)
What is iRODS
Open Source
- C++ client-server architecture
- iRODS Protocol and RPC API
- BSD-3 Licensed
Distributed
- Runs on a laptop, a cluster, on premises or geographically distributed
Data Centric & Metadata Driven
- Insulate both your users and your data from your infrastructure over time
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
iRODS as the Integration Layer
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340667/integration_layer_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Why use iRODS?
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
People need a solution for:
- Managing large amounts of data across various storage technologies
- Controlling access to data
- Searching their data quickly and efficiently
- Automation
The larger the organization, the more they need software like iRODS.
Ingest to Institutional Repository
As data matures and reaches a broader community, data management policy must also evolve to meet these additional requirements.
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340707/data_lifecycle_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Data Virtualization
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/2723376/data_virtualization.png)
Combine various distributed storage technologies into a Unified Namespace
- Existing file systems
- Cloud storage
- On premises object storage
- Archival storage systems
iRODS provides a logical view into the complex physical representation of your data, distributed geographically, and at scale.
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340850/stack_bottom.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Data Discovery
Attach metadata to any first class entity within the iRODS Zone
- Data Objects
- Collections
- Users
- Storage Resources
- The Namespace
iRODS supports automated and user-provided metadata which makes your data and infrastructure more discoverable, operational, and valuable.
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/2723369/data_discovery.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340850/stack_bottom.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Workflow Automation
Policy Enforcement Points (PEPs) are triggered by every operation within the framework
- Authentication
- Storage Access
- Database Interaction
- Network Activity
- Extensible RPC API
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/2723379/workflow_automation.png)
The iRODS rule engine framework provides the ability to capture real world policy as computer actionable rules which may allow, deny, or add context to operations within the system.
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340850/stack_bottom.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Dynamic Policy Enforcement
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/4042214/iput_dyn_pep_diagram.png)
The iRODS rule may:
- restrict access
- log for audit and reporting
- provide additional context
- send a notification
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Dynamic Policy Enforcement
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/4073850/expanded_dyn_pep_diagram.png)
A single API call expands to many plugin operations all of which may invoke policy enforcement.
Plugin Interfaces:
- Authentication
- Database
- Storage
- Network
- Rule Engine
- Microservice
- RPC API
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Secure Collaboration
iRODS allows for collaboration across administrative boundaries after deployment
- No need for common infrastructure
- No need for shared funding
- Affords temporary collaborations
iRODS provides the ability to federate namespaces across organizations without pre-coordinated funding or effort.
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/2723383/secure_collaboration.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340850/stack_bottom.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
The Data Management Model
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5342069/data_management_model_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5342080/stack_full.jpg)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Protocol Plumbing - Presenting iRODS as other Protocols
- WebDAV
- FUSE
- HTTP
- NFS
- SFTP
- K8s CSI
- S3
Over the last few years, the ecosystem around the iRODS server has continued to expand.
Integration with other types of systems is a valuable way to increase accessibility without teaching existing tools about the iRODS protocol or introducing new tools to users.
With some plumbing, existing tools get the benefit of visibility into an iRODS deployment.
Future
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
- Towards Cloud-Native processes and bookkeeping
- Vertical Integrations in various domains
- Timeseries Data / Statistics
- Dashboarding
- Visibility
- Costs
Thank you
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Terrell Russell
RENCI Booth #3923
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11640835/pasted-from-clipboard.png)
iRODS Clients
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11471588/Screen_Shot_2024-07-14_at_9.24.40_PM.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11471589/Screen_Shot_2024-07-14_at_9.24.53_PM.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/11471590/Screen_Shot_2024-07-14_at_9.25.10_PM.png)
iRODS S3 Functionality
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
The iRODS S3 storage resource plugin allows iRODS to use any S3-compatible storage device or service to hold iRODS Data Objects, on-premises or in the cloud.
This plugin can work as a standalone "cacheless" resource or as an archive resource under the iRODS compound resource. Either configuration provides a POSIX interface to data held on an object storage device or service.
The following S3 services and appliances (in no particular order) have been tested:
- Amazon (AWS) S3
- Fujifilm Object Archive
- MinIO S3
- Ceph S3
- Spectra Logic Vail
- Spectra Logic BlackPearl
- Google Cloud Storage (GCS)
- Wasabi S3
- Oracle OCI
- Quantum ActiveScale
- Garage S3
Storage Tiering
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340962/stack_capabilities.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341000/storage_tiering_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Automated Ingest - Landing Zone
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340962/stack_capabilities.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341026/landing_zone_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Automated Ingest - Filesystem Scanning
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5340962/stack_capabilities.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Core Competencies
Policy
Capabilities
Indexing
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/6254145/indexing_icon_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/6254222/indexing_diagram_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Core Competencies
Policy
Capabilities
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/6257786/publication_diagram_transparent.png)
Publishing
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/6257762/publishing_icon_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Deployment Patterns
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341425/stack_patterns.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/3210707/simple_hpc.png)
Data to Compute
Compute to Data
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341747/compute_to_data_transparent.png)
Data Transfer Nodes
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/6764698/data_transfer_node_transparent.png)
Filesystem Synchronization
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5352701/filesystem_sync_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Filesystem Synchronization
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341425/stack_patterns.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5352701/filesystem_sync_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Data to Compute
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341425/stack_patterns.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/3210707/simple_hpc.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Compute to Data
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341425/stack_patterns.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341747/compute_to_data_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
Data Transfer Nodes
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/5341425/stack_patterns.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/320802/images/6764698/data_transfer_node_transparent.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/336929/images/7826818/hex_inner_page_bg_2020_widescreen.jpg)
SC24 - Building an Approachable Cost-Effective Data Management Platform
By iRODS Consortium
SC24 - Building an Approachable Cost-Effective Data Management Platform
- 76