Container technologies such as Docker have become a crucial component of many
software industry practices especially those pertaining to reproducibility and
portability. The containerization philosophy has influenced the scientific
computing community, which has begun to adopt - and even develop - container
technologies (such as Singularity). Leveraging containers for scientific
software often poses challenges distinct from those encountered in industry,
and requires different methodologies. This is especially true for HPC. With an
increasing number of options for HPC in the cloud (including NSF-funded cloud
projects), there is strong motivation to seek solutions that provide
flexibility to develop and deploy scientific software on a variety of
computational infrastructures in a portable and reproducible way. The
flexibility offered by cloud services enables virtual HPC clusters that scale
on-demand, and the Cyberinfrastructure Resource Integration team in the XSEDE
project has developed a set of tools which provides scalable infrastructure in
the cloud. We now present a solution which uses the Nix package manager in an
MPI-capable Docker container that is converted to Singularity. It provides
consistent installations, dependencies, and environments in each image that are
reproducible and portable across scientific computing infrastructures. We
demonstrate the utility of these containers with cluster benchmark runs in a
self-scaling virtual cluster using the Slurm scheduler deployed in the
Jetstream and Aristotle Red Cloud OpenStack clouds. We conclude this technique
is useful as a template for scientific software application containers to be
used in the XSEDE compute environment, other Singularity HPC environments, and
cloud computing environments.
Description
[2006.14784] Self-Scaling Clusters and Reproducible Containers to Enable Scientific Computing
%0 Generic
%1 vaillancourt2020selfscaling
%A Vaillancourt, Peter Z.
%A Coulter, J. Eric
%A Knepper, Richard
%A Barker, Brandon
%D 2020
%K hpc nix
%T Self-Scaling Clusters and Reproducible Containers to Enable Scientific
Computing
%U http://arxiv.org/abs/2006.14784
%X Container technologies such as Docker have become a crucial component of many
software industry practices especially those pertaining to reproducibility and
portability. The containerization philosophy has influenced the scientific
computing community, which has begun to adopt - and even develop - container
technologies (such as Singularity). Leveraging containers for scientific
software often poses challenges distinct from those encountered in industry,
and requires different methodologies. This is especially true for HPC. With an
increasing number of options for HPC in the cloud (including NSF-funded cloud
projects), there is strong motivation to seek solutions that provide
flexibility to develop and deploy scientific software on a variety of
computational infrastructures in a portable and reproducible way. The
flexibility offered by cloud services enables virtual HPC clusters that scale
on-demand, and the Cyberinfrastructure Resource Integration team in the XSEDE
project has developed a set of tools which provides scalable infrastructure in
the cloud. We now present a solution which uses the Nix package manager in an
MPI-capable Docker container that is converted to Singularity. It provides
consistent installations, dependencies, and environments in each image that are
reproducible and portable across scientific computing infrastructures. We
demonstrate the utility of these containers with cluster benchmark runs in a
self-scaling virtual cluster using the Slurm scheduler deployed in the
Jetstream and Aristotle Red Cloud OpenStack clouds. We conclude this technique
is useful as a template for scientific software application containers to be
used in the XSEDE compute environment, other Singularity HPC environments, and
cloud computing environments.
@misc{vaillancourt2020selfscaling,
abstract = {Container technologies such as Docker have become a crucial component of many
software industry practices especially those pertaining to reproducibility and
portability. The containerization philosophy has influenced the scientific
computing community, which has begun to adopt - and even develop - container
technologies (such as Singularity). Leveraging containers for scientific
software often poses challenges distinct from those encountered in industry,
and requires different methodologies. This is especially true for HPC. With an
increasing number of options for HPC in the cloud (including NSF-funded cloud
projects), there is strong motivation to seek solutions that provide
flexibility to develop and deploy scientific software on a variety of
computational infrastructures in a portable and reproducible way. The
flexibility offered by cloud services enables virtual HPC clusters that scale
on-demand, and the Cyberinfrastructure Resource Integration team in the XSEDE
project has developed a set of tools which provides scalable infrastructure in
the cloud. We now present a solution which uses the Nix package manager in an
MPI-capable Docker container that is converted to Singularity. It provides
consistent installations, dependencies, and environments in each image that are
reproducible and portable across scientific computing infrastructures. We
demonstrate the utility of these containers with cluster benchmark runs in a
self-scaling virtual cluster using the Slurm scheduler deployed in the
Jetstream and Aristotle Red Cloud OpenStack clouds. We conclude this technique
is useful as a template for scientific software application containers to be
used in the XSEDE compute environment, other Singularity HPC environments, and
cloud computing environments.},
added-at = {2020-10-09T18:09:33.000+0200},
author = {Vaillancourt, Peter Z. and Coulter, J. Eric and Knepper, Richard and Barker, Brandon},
biburl = {https://www.bibsonomy.org/bibtex/2ddc786e619aa5da8ccbb141eddbdcde7/alexv},
description = {[2006.14784] Self-Scaling Clusters and Reproducible Containers to Enable Scientific Computing},
interhash = {f5acffe6f1c0dd39bc1b6f3ab247c403},
intrahash = {ddc786e619aa5da8ccbb141eddbdcde7},
keywords = {hpc nix},
note = {cite arxiv:2006.14784Comment: Accepted for publication in the IEEE conference proceedings for HPEC 2020},
timestamp = {2020-10-09T18:09:33.000+0200},
title = {Self-Scaling Clusters and Reproducible Containers to Enable Scientific
Computing},
url = {http://arxiv.org/abs/2006.14784},
year = 2020
}