ATHENA Research & Innovation CenterComputer EngineeringKirchhoff Institute for PhysicsSimulation & Data Lab NeuroscienceMar 12, 2026arXiv:2603.12044

HPC Containers for EBRAINS: Towards Portable Cross-Domain Software Environment

Krishna Kant Singh, K. Singh, Eric Müller, E. Muller, Eleni Mathioulaki, Wouter Klijn, W. Klijn, Lena Oden

AI Summary

This paper explores the design and performance of portable HPC container images, built using Apptainer and PMIx, for encapsulating MPI- and CUDA-enabled software stacks within the EBRAINS Software Distribution. The key finding is that these containers can effectively leverage host-level hardware (network interfaces, GPUs) on different HPC clusters (Karolina, Jureca-DC) without significant performance degradation compared to bare-metal deployments. The verification methodology includes communication microbenchmarks (OSU, NCCL) and analysis of debug logs to detect misconfigurations.

Key Contribution

Achieve bare-metal HPC performance with portable containers across diverse architectures, finally decoupling software from infrastructure headaches.

Abstract

Deploying complex, distributed scientific workflows across diverse HPC sites is often hindered by site-specific dependencies and complex build environments. This paper investigates the design and performance of portable HPC container images capable of encapsulating MPI- and CUDA-enabled software stacks without sacrificing bare-metal performance. This work is part of recent work performed within the EBRAINS Research Infrastructure, to evaluate the implementation of portable HPC (Apptainer-based) container images targeting the EBRAINS Software Distribution (ESD) -- a Spack-based software ecosystem comprising approximately 80 top-level packages (and 800 dependencies). We evaluate a hybrid, PMIx-based containerization strategy using Apptainer that seamlessly bypasses the need for site-specific builds by dynamically leveraging host-level specialized hardware, such as network interfaces and GPUs, on two production HPC clusters: Karolina and Jureca-DC. We demonstrate the feasibility of building portable, MPI- and CUDA-enabled scientific software into container images that correctly leverage site-installed drivers and hardware to reproduce bare-metal communication behavior. Using communication microbenchmarks (e.g., OSU and NCCL) alongside performance metrics of applications from neuroscience, we measure and verify their performance against bare-metal deployments. Crucially, our verification approach extends beyond top-level runtime measurements; we highlight the analysis of underlying debug logs to actively detect misbehavior and misconfigurations, such as suboptimal transport pathways. Ultimately, this investigation demonstrates the feasibility of a simple and reproducible methodology for decoupling software environments from underlying infrastructures, paving the way for automated pipelines that ensure optimized, performance-verified execution across varied HPC architectures.

Distributed Systems & Hardware Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References32

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

HPC Containers for EBRAINS: Towards Portable Cross-Domain Software Environment

Related Papers