Alan King, Senior Software Developer
Justin James, Senior Applications Engineer
iRODS Consortium
Advanced Training:
Getting Started
June 13-16, 2023
iRODS User Group Meeting 2023
Chapel Hill, NC
Little Slips of Paper & Dependencies
curl -LO https://github.com/irods/irods_training/raw/ugm2023/training.pem
chmod 600 training.pem
ssh -i training.pem ubuntu@#.#.#.#
Logging in to your VM:
Install iRODS Build Requirements:
sudo apt-get update
sudo apt-get -y install git g++ make python-dev help2man unixodbc libfuse-dev libcurl4-gnutls-dev libbz2-dev zlib1g-dev libpam0g-dev libssl-dev libxml2-dev libkrb5-dev unixodbc-dev libjson-perl python3-psutil python3-jsonschema super python3-exif odbc-postgresql unzip jq
Acquire the Prerequisites
Clone the training repository:
git clone https://github.com/irods/irods_training
Install and configure Postgres
ubuntu $ sudo apt-get -y install postgresql
ubuntu $ sudo su - postgres
ubuntu $ sudo pg_ctlcluster 12 main start
Prepare database system for iRODS use:
postgres $ psql
CREATE DATABASE "ICAT";
CREATE USER irods WITH PASSWORD 'testpassword';
GRANT ALL PRIVILEGES ON DATABASE "ICAT" to irods;
\q
postgres $ exit
Configure the Repository, Install, Run setup
wget -qO - https://packages.irods.org/irods-signing-key.asc | sudo apt-key add - echo "deb [arch=amd64] https://packages.irods.org/apt/ $(lsb_release -sc) main" | \ sudo tee /etc/apt/sources.list.d/renci-irods.list sudo apt-get update
sudo apt-get -y install irods-server irods-database-plugin-postgres
Install public key and add repository:
Install from repository:
Read more: https://packages.irods.org
sudo python3 /var/lib/irods/scripts/setup_irods.py < /var/lib/irods/packaging/localhost_setup_postgres.input
Run setup with provided input file:
Confirm iRODS setup
sudo su - irods
./irodsctl start ils
Switch to irods user:
Start server and make a request
iadmin set_delay_server $(hostname) ps aux | grep irodsDelayServer
Set delay server and confirm it is running:
sudo sed -i '11a export LD_PRELOAD=/usr/lib/python3.8/config-3.8-x86_64-linux-gnu/libpython3.8.so' /var/lib/irods/irodsctl
As ubuntu user, add a line to irodsctl:
logout
Switch back to ubuntu user:
Setup iRODS for Auditing
Before we continue with the training, we are going to set up some auditing in iRODS so that we can report on all of the activities in our iRODS instance.
At the end of today's training, we will revisit this and visualize what has happened in iRODS throughout the day.
The first action is to install the auditing plugin...
sudo apt-get -y install irods-rule-engine-plugin-audit-amqp
Setup the iRODS Audit Plugin
Edit /etc/irods/server_config.json
add a new stanza to the rule_engines array after irods_rule_engine_plugin-irods_rule_language-instance
add the audit namespace
"rule_engines": [
{
"instance_name": "irods_rule_engine_plugin-irods_rule_language-instance",
...
...
"shared_memory_instance": "irods_rule_language_rule_engine"
},
{
"instance_name": "irods_rule_engine_plugin-audit_amqp-instance",
"plugin_name": "irods_rule_engine_plugin-audit_amqp",
"plugin_specific_configuration" : {
"amqp_location" : "ANONYMOUS@localhost:5672",
"amqp_options" : "",
"amqp_topic" : "audit_messages",
"pep_regex_to_match" : "audit_.*"
}
},
{
"instance_name": "irods_rule_engine_plugin-cpp_default_policy-instance",
...
...
"rule_engine_namespaces": [
"",
"audit_"
],
Setup Monitoring
curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -
sudo add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable"
sudo apt-get update
sudo apt-get install -y docker-ce
sudo usermod -aG docker ${USER}
The iRODS plugin produces AMQP 1.0 messages for each dynamic policy enforcement point.
We will now setup a docker container which will accept these messages, store them in an Elastic database, and provide a visualization web tool.
First we need to install Docker.
Log out of your virtual machine and log back in to make sure your group list is updated.
Setup Monitoring
docker run -dit -p 8080:15672 -p 5672:5672 -p 80:5601 -p 9200:9200 irods/irods_audit_elk_stack
From within your virtual machine, run this pre-built ELK stack docker image:
You now have a docker container instance running within your virtual machine.
This instance is running the following:
RabbitMQ - Message broker that stores the AMQP messages
Elasticsearch - Database that stores the AMQP messages
Logstash - Reads message from RabbitMQ and writes them to Elasticsearch
Kibana - A data visualization dashboarding tool for ElasticSearch
docker ps
Confirm that the container is running:
Configure the visualization
Port 80 on your VM is now mapped to the Kibana web tool.
Let's configure Kibana.
In a web browser, navigate to http://#.#.#.#/ to open the Kibana dashboard.
Click on Management (left pane) -> Index Patterns.
Type "irods_audit" in the index pattern field and click next step.
Select "@timestamp" in the time filter field name and click create index pattern.
Configure the visualization
Now we want to create visualizations and a dashboard to view iRODS activity.
We have already created a dashboard for you to load.
Save the irods_training/advanced/example_kibana_dashboard.json to your local computer (not the VM).
You can copy/paste from your VM into a local file or clone the irods_training repository on your local machine.
Click on Management (left pane) -> Saved Objects and click Import.
Select the file you saved in #1. Confirm the changes.
Click on the Dashboard (left pane) -> and then iRODS Dashboard.
Go ahead and click on the Auto Refresh button at the top and change the refresh period to 30 seconds.
Click on the clock icon (Last 15 minutes) at the top of the screen and select "today".
Configure the visualization
You should see a dashboard that looks similar to the screenshot below.
If you have not executed a put or get to iRODS yet...
the Bytes Read Per Minute and Bytes Written Per Minute will report no data.
What to Consider in an iRODS Deployment
Things to consider
Number of users and expected simultaneous connections
Expected ingest rate
Sizes of files
many small files (more overhead per connection)
Partial read / write vs whole file usage
Replication for durability
Replication for locality of reference
Load balancing vs High Availability
iRODS will run on a laptop or a rack of servers
Upgrading Large Installations
Things to consider
Database Snapshots
Maintenance Window
Attempt a graceful grid-wide shutdown ahead of time
Test Zones - do not upgrade blindly
Conformance Tests - try your edge cases
Federated Zones - how mixed is your deployment
Questions?
Anatomy of an iRODS installation
/etc/irods/core.* - iRODS Rule Language
/etc/irods/server_config.json - primary server configuration
/etc/irods/service_account.config - service account information
/usr/bin/* - iCommands
/usr/sbin/irodsAgent
/usr/sbin/irodsPamAuthCheck
/usr/sbin/irodsDelayServer
/usr/sbin/irodsServer
/var/lib/irods - service account home directory
/var/log/irods - log files
/usr/lib/irods/plugins - plugins location
Introduction to iCommands
iRODS command line equivalent to standard Unix operations
ils
icd
ipwd
iput
iget
irepl
use -h to get help with any particular iCommand
ihelp will show all available iCommands
Questions?