Spack

Getting started

The official spack documentation can be found at https://spack.readthedocs.io/ (the correct version can be opened by clicking on the version number in the table below)

In order to use Spack, you need to load the corresponding module first. The module name for each software stack that has a Spack module is listed below:

Software Stack	Module Name	Version
GWDG Modules (gwdg-lmod) (default)	`spack`	0.23.1
SCC Modules (scc-lmod)	`spack-user`	0.21.0
NHR Modules (nhr-lmod)	`spack`	0.21.2

Then, load it by

load spack module:

module load spack

module load spack-user

module load spack

Please note: Some commands of Spack require extra shell functionality and you need to source the environment script:

source $SPACK_ROOT/share/spack/setup-env.sh

This command will also be shown every time you load the spack module to remind you.

Now Spack is ready and you can use spack load command to load necessary software or spack install SPEC to install the software.

Loading software

You can find spack packages that are already installed using spack find:

spack find py-numpy

You may need to pick a specific version of those listed to load:

spack load py-numpy%gcc@14.2.0 target=sapphirerapids

Installing software

In most cases, it is enough to know the package name to install the software. For instance, if you want to install zlib, then you can simply run the following command:

spack install zlib

In general, you need to provide a specification and dependencies for the command spack install, where you can select versions, compiler, dependencies and variants. To learn more about Spack specification, please visit Spack’s documentation.

Hardware Architectures

Since we have multiple CPU architectures, connection fabrics, and GPU architectures on the clusters; it can pay off to optimize your software for the architecture of the nodes it will run on. For example, if you plan on running your software on a Cascadelake node, it can be accelerated by compiling it to use Cascadelake’s AVX512 instructions. A package would be compiled by setting the target in the spec to the desired CPU architecture like:

spack install gromacs target=cascadelake

The spack arch command will print the full CPU and OS architecture/target of the node you are on (e.g. linux-rocky8-sapphirerapids), and spack find will show you what you have built for each architecture (architecture is in the headers). The architecture/target is composed of the operating system (linux), the Linux distribution, and the CPU architecture. Note that the architecture/target does not capture other important hardware features like the fabric (mainly MPI libraries and their dependencies) and CUDA architecture. For CUDA, the cuda_arch parameter should be set to the CUDA compute capability and the +cuda variants enabled. For the MPI libraries, you should try to use the ones already installed by our staff that are tested and optimized to work well on the system. We recommend using the default openmpi variant with our default GCC compiler. The default can be found by executing module load gcc openmpi followed by module list, which will give you the recommended version numbers as part of the module names.

Make sure to install the software separately for every architecture you want it to run on from a node with that particular architecture. The easiest way to ensure you are on the right architecture is to start an interactive slurm job on the same partition (and same kind of node if a partition is mixed architecture) you want to use the software on. To learn more about Spack and how to install software you can go through its tutorial at https://spack-tutorial.readthedocs.io/en/latest/

Tip

Software installed as modules is already built for all targets separately. The correct version is chosen automatically by module load for the node it is running on. This makes sure that the spack or spack-user module has right compilers and default configuration selected for the node.

Warning

Cross-compiling packages for a different CPU architectures than the node spack is running on is error prone when it is possible (some combinations are impossible) and should be avoided when possible. The one exception to this is compiling packages for a compatible CPU with less features than the CPU on the node spack is running on (e.g. compiling for skylake_avx512 on a cascadelake node), but even this requires care. Also, cross-linux-distro builds (compiling for rocky8 on centos7) are outright impossible with Spack.

The nodes currently supported for Spack and their architectures organized by cluster island are given in the table below.

Nodes	CPU and OSArchitecture/Target	Fabric	cuda_arch
Emmy Phase 3	`linux-rocky8-sapphirerapids`	OPA
Emmy Phase 2	`linux-rocky8-cascadelake`	OPA
Emmy Phase 1	`linux-rocky8-skylake_avx512`	OPA
Grete Phase 3	`linux-rocky8-sapphirerapids`	IB	90
Grete Phase 2	`linux-rocky8-zen2`	IB	80
Grete Phase 1	`linux-rocky8-skylake_avx512`	IB	70
SCC Legacy (CPU)	`linux-rocky8-cascadelake` (medium)	OPA none (Ethernet only)
SCC Legacy (GPU)	`linux-rocky8-cascadelake`	OPA	70
CIDBN	`linux-rocky8-zen3`	IB
FG	`linux-rocky8-zen3`	RoCE
SOE	`linux-rocky8-zen2`	RoCE

Info

Note that to reduce the total number of separate architectures, some are grouped together and rounded down to the lowest common denominator for CPU architectues and the minimum for CUDA architectue. For example, the lowest common denominator of CPU architectures zen2 and zen3 is zen2, and CUDA architectures 70 and 75 is 70.

See CPU partitions and GPU partitions for the Slurm partitions in each island.