Skip to content
Joe Wallwork edited this page Oct 19, 2023 · 9 revisions

Process for running Firedrake scripts on ARCHER2 using a singularity container, rather than installing Firedrake

Convert the Docker container

A two step procedure is required for performing the image conversion. This example uses the firedrake-vanilla container, but firedrake, firedrake-notebooks and firedrake-complex should also work correctly.

  1. Build the image in a sandbox from the Dockerhub singularity build --sandbox ./firedrake-vanilla docker://firedrakeproject/firedrake-vanilla

  2. Convert the sandboxed image to a Singularity container singularity build firedrake-vanilla.sif ./firedrake-vanilla

Running

Example jobscript for running on ARCHER2:

#!/bin/bash
#SBATCH -p standard
#SBATCH -A account
#SBATCH -J singularity
#SBATCH --nodes=1
#SBATCH --cpus-per-task=1
#SBATCH --qos=standard
#SBATCH -t 0:10:00

myScript="HPC_demo.py"

module purge
module load load-epcc-module

module load PrgEnv-gnu/8.3.3

module swap cray-mpich cray-mpich-abi/8.1.23
module swap cray-libsci cray-libsci/22.12.1.1
module load xpmem

cat <<EOF >.gitconfig
[safe]
	directory = *
EOF

export SINGULARITYENV_LD_LIBRARY_PATH="/opt/cray/pe/mpich/8.1.23/ofi/gnu/9.1/lib-abi-mpich:/opt/cray/pe/mpich/8.1.23/gtl/lib:/opt/cray/libfabric/1.12.1.2.2.0.0/lib64:/opt/cray/pe/gcc-libs:/opt/cray/pe/lib64:/opt/cray/xpmem/default/lib64:/usr/lib64/libibverbs:/usr/lib64"
    
export SINGULARITY_BIND="/opt/cray,/var/spool,/opt/cray/pe/mpich/8.1.23/ofi/gnu/9.1/lib-abi-mpich:/opt/cray/pe/mpich/8.1.23/gtl/lib,/etc/host.conf,/etc/libibverbs.d/mlx5.driver,/etc/libnl/classid,/etc/resolv.conf,/opt/cray/libfabric/1.12.1.2.2.0.0/lib64/libfabric.so.1,/opt/cray/pe/gcc-libs/libatomic.so.1,/opt/cray/pe/gcc-libs/libgcc_s.so.1,/opt/cray/pe/gcc-libs/libgfortran.so.5,/opt/cray/pe/gcc-libs/libquadmath.so.0,/opt/cray/pe/lib64/libpals.so.0,/opt/cray/pe/lib64/libpmi2.so.0,/opt/cray/pe/lib64/libpmi.so.0,/opt/cray/xpmem/default/lib64/libxpmem.so.0,/run/munge/munge.socket.2,/usr/lib64/libibverbs/libmlx5-rdmav34.so,/usr/lib64/libibverbs.so.1,/usr/lib64/libkeyutils.so.1,/usr/lib64/liblnetconfig.so.4,/usr/lib64/liblustreapi.so,/usr/lib64/libmunge.so.2,/usr/lib64/libnl-3.so.200,/usr/lib64/libnl-genl-3.so.200,/usr/lib64/libnl-route-3.so.200,/usr/lib64/librdmacm.so.1,/usr/lib64/libyaml-0.so.2"

export SINGULARITYENV_OMP_NUM_THREADS=1
export SINGULARITYENV_PYOP2_CACHE_DIR=/tmp/$USER/pyop2
export SINGULARITYENV_PYOP2_CC=/home/firedrake/firedrake/bin/mpicc
export SINGULARITYENV_PYOP2_CXX=/home/firedrake/firedrake/bin/mpicxx
export SINGULARITYENV_FIREDRAKE_TSFC_KERNEL_CACHE_DIR=/tmp/$USER/tsfc

srun --ntasks-per-node 128  \
    singularity run --bind $PWD:/home/firedrake/work --home $PWD firedrake-vanilla.sif \
        /home/firedrake/firedrake/bin/python \
            /home/firedrake/work/${myScript}

If you save this jobscript as firedrake_jobscript.slm it can be submitted to the queue by executing

sbatch firedrake_jobscript.slm

on the login node.

NOTE: Do not try and replace the Python interpreter /home/firedrake/firedrake/bin/python with the BASH interpreter /bin/bash. If you want to execute a sequence of bash commands, write a short shell script instead.

Key points to note:

  • We assume the script referenced bt the bash variable myScript is in the current directory and that directory is somewhere in the ARCHER2 /work filesystem not the /home filesystem.
  • We use cray-mpich-abi in place of cray-mpich.
  • A .gitconfig file is created marking all directories as safe for git. This is necessary since the Docker container runs as the firedrake user, but Singularity runs as the current user. Without it each rank spews many errors, sometimes crashing the interconnect. Since the $PWD is mounted as home the Singularity container sees this file as $HOME/.gitconfig.
  • export SINGULARITYENV_LD_LIBRARY_PATH and export SINGULARITY_BIND are ARCHER2 specific and essential.
  • A SINGULARITYENV_FOO environment variable sets the environment variable FOO instide the singularity container.
  • PYOP2_CACHE_DIR and FIREDRAKE_TSFC_KERNEL_CACHE_DIR deafult to the $HOME directory on the host and is automatically mounted inside the singularity container. This is an issue on ARCHER2, since $HOME is not mounted on compute nodes. To ensure the location is writable these are set to directories in /tmp. Any writable alternative location could be used (for instance $PWD).
  • PYOP2_CC and PYOP2_CXX are set to ensure the container compiler are used.
  • The argument --bind $PWD:/home/firedrake/work is necessary to mount the current directory somewhere sensible within the container so that the script on the host filesystem can be executed in the container.
  • The argument --home $PWD ensures that any files written to $HOME in the container are written to the current directory. USeful when output files are written to a relative path rather than absolute, since the default user directory inside the container is normally $HOME.

Home

Building locally
Tips

Install Frequently Asked Questions

Running on HPC

Users

Developers Notes

Minutes and agenda of Firedrake meetings


Policies and procedures

Gravity wave scaling

Merge Complex Sprint

Reading Group

Firedrake 2021 Planning Meetings
Clone this wiki locally