Finding container images for data analysis

Johannes Köster

HPCW 2020

dataset

results

dataset

dataset

Define software stacks.

Build container images.

Use images for execution.

Issue:

Overhead, explosion of image variants.

Workarounds:

  • not using containers (🗲 reproducibility)
  • no fine-grained containers (🗲 transparency)

dataset

results

dataset

dataset

Define software stacks.

Find container images.

Use images for execution.

- python =3.8.3
- matplotlib =3.2.1
- scikit-learn =0.23.1

dataset

results

dataset

dataset

Define software stacks.

Find container images.

Use images for execution.

- python =3.8.3
- matplotlib =3.2.1
- scikit-learn =0.23.1