Initial import from https://github.com/RBigData/R4HPC

Add example programs from the R4HPC repository
hpc-carpentry · Jul 9, 2023 · 1a20b7a · 1a20b7a
1 parent 3e39085
commit 1a20b7a
Show file tree

Hide file tree

Showing 8 changed files with 1,111 additions and 486 deletions.
diff --git a/episodes/02-submit-job.Rmd b/episodes/02-submit-job.Rmd
@@ -1,113 +1,174 @@
 ---
-title: "Using RMarkdown"
+title: "Submit a parallel job"
 teaching: 10
 exercises: 2
 ---
 
 :::::::::::::::::::::::::::::::::::::: questions 
 
-- How do you write a lesson using R Markdown and `{sandpaper}`?
+- How do you get a high performance computing cluster to run a program?
 
 ::::::::::::::::::::::::::::::::::::::::::::::::
 
 ::::::::::::::::::::::::::::::::::::: objectives
 
-- Explain how to use markdown with the new lesson template
-- Demonstrate how to include pieces of code, figures, and nested challenge blocks
+- Introduce a parallel R program
+- Submit a parallel R program to a job scheduler on a cluster
 
 ::::::::::::::::::::::::::::::::::::::::::::::::
 
 ## Introduction
 
-This is a lesson created via The Carpentries Workbench. It is written in
-[Pandoc-flavored Markdown](https://pandoc.org/MANUAL.txt) for static files and
-[R Markdown][r-markdown] for dynamic files that can render code into output. 
-Please refer to the [Introduction to The Carpentries 
-Workbench](https://carpentries.github.io/sandpaper-docs/) for full documentation.
-
-What you need to know is that there are three sections required for a valid
-Carpentries lesson template:
-
- 1. `questions` are displayed at the beginning of the episode to prime the
-    learner for the content.
- 2. `objectives` are the learning objectives for an episode displayed with
-    the questions.
- 3. `keypoints` are displayed at the end of the episode to reinforce the
-    objectives.
-
-:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: instructor
-
-Inline instructor notes can help inform instructors of timing challenges
-associated with the lessons. They appear in the "Instructor View"
-
-::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
-
-::::::::::::::::::::::::::::::::::::: challenge 
-
-## Challenge 1: Can you do it?
-
-What is the output of this command?
 
 ```r
-paste("This", "new", "lesson", "looks", "good")
+## This script describes two levels of parallelism:
+## Top level: Distributed MPI runs several copies of this entire script.
+##            Instances differ by their comm.rank() designation.
+## Inner level: The unix fork (copy-on-write) shared memory parallel execution
+##            of the mc.function() managed by parallel::mclapply()
+## Further levels are possible: multithreading in compiled code and communicator
+## splitting at the distributed MPI level.
+
+suppressMessages(library(pbdMPI))
+comm.print(sessionInfo())
+
+## get node name
+host = system("hostname", intern = TRUE)
+
+mc.function = function(x) {
+    Sys.sleep(1) # replace with your function for mclapply cores here
+    Sys.getpid() # returns process id
+}
+
+## Compute how many cores per R session are on this node
+local_ranks_query = "echo $OMPI_COMM_WORLD_LOCAL_SIZE"
+ranks_on_my_node = as.numeric(system(local_ranks_query, intern = TRUE))
+cores_on_my_node = parallel::detectCores()
+cores_per_R = floor(cores_on_my_node/ranks_on_my_node)
+cores_total = allreduce(cores_per_R)  # adds up over ranks
+
+## Run mclapply on allocated cores to demonstrate fork pids
+my_pids = parallel::mclapply(1:cores_per_R, mc.function, mc.cores = cores_per_R)
+my_pids = do.call(paste, my_pids) # combines results from mclapply
+##
+## Same cores are shared with OpenBLAS (see flexiblas package)
+##            or for other OpenMP enabled codes outside mclapply.
+## If BLAS functions are called inside mclapply, they compete for the
+##            same cores: avoid or manage appropriately!!!
+
+## Now report what happened and where
+msg = paste0("Hello World from rank ", comm.rank(), " on host ", host,
+             " with ", cores_per_R, " cores allocated\n",
+             "            (", ranks_on_my_node, " R sessions sharing ",
+             cores_on_my_node, " cores on this host node).\n",
+             "      pid: ", my_pids, "\n")
+comm.cat(msg, quiet = TRUE, all.rank = TRUE)
+
+
+comm.cat("Total R sessions:", comm.size(), "Total cores:", cores_total, "\n",
+         quiet = TRUE)
+comm.cat("\nNotes: cores on node obtained by: detectCores {parallel}\n",
+         "       ranks (R sessions) per node: OMPI_COMM_WORLD_LOCAL_SIZE\n",
+         "       pid to core map changes frequently during mclapply\n",
+         quiet = TRUE)
+
+finalize()
 ```
 
-:::::::::::::::::::::::: solution 
-
-## Output
-
-```output
-[1] "This new lesson looks good"
+## Submit a job on a cluster
+
+
+:::::::::::::::::::::::: solution
+
+## Slurm
+
+
+```bash
+#!/bin/bash
+#SBATCH -J hello
+#SBATCH -A CSC489
+#SBATCH -p batch
+#SBATCH --nodes=4
+#SBATCH --mem=0
+#SBATCH -t 00:00:10
+#SBATCH -e ./hello.e
+#SBATCH -o ./hello.o
+#SBATCH --open-mode=truncate
+
+## above we request 4 nodes and all memory on the nodes
+
+## assumes this repository was cloned in your home area
+cd ~/R4HPC/code_1
+pwd
+
+## modules are specific to andes.olcf.ornl.gov
+module load openblas/0.3.17-omp
+module load flexiblas
+flexiblas add OpenBLAS $OLCF_OPENBLAS_ROOT/lib/libopenblas.so
+export LD_PRELOAD=$OLCF_FLEXIBLAS_ROOT/lib64/libflexiblas.so
+module load r
+echo -e "loaded R with FlexiBLAS"
+module list
+
+## above supplies your R code with FlexiBLAS-OpenBLAS on Andes
+## but matrix computation is not used in the R illustration below
+
+# An illustration of fine control of R scripts and cores on several nodes
+# This runs 4 R sessions on each of 4 nodes (for a total of 16).
+#
+# Each of the 16 hello_world.R scripts will calculate how many cores are
+# available per R session from environment variables and use that many
+# in mclapply.
+#
+# NOTE: center policies may require dfferent parameters
+#
+# runs 4 R sessions per node
+mpirun --map-by ppr:4:node Rscript hello_balance.R
 ```
-
 :::::::::::::::::::::::::::::::::
 
 
-## Challenge 2: how do you nest solutions within challenge blocks?
+:::::::::::::::::::::::: solution
 
-:::::::::::::::::::::::: solution 
+## PBS
 
-You can add a line with at least three colons and a `solution` tag.
+```bash
+#!/bin/bash
+#PBS -N hello
+#PBS -A DD-21-42
+#PBS -l select=4:mpiprocs=16
+#PBS -l walltime=00:00:10
+#PBS -q qprod
+#PBS -e hello.e
+#PBS -o hello.o
 
-:::::::::::::::::::::::::::::::::
-::::::::::::::::::::::::::::::::::::::::::::::::
-
-## Figures
+cat $BASH_SOURCE 
+cd ~/R4HPC/code_1
+pwd
 
-You can also include figures generated from R Markdown:
-
-```{r pyramid, fig.alt = "pie chart illusion of a pyramid", fig.cap = "Sun arise each and every morning"}
-pie(
-  c(Sky = 78, "Sunny side of pyramid" = 17, "Shady side of pyramid" = 5), 
-  init.angle = 315, 
-  col = c("deepskyblue", "yellow", "yellow3"), 
-  border = FALSE
-)
-```
+## module names can vary on different platforms
+module load R
+echo "loaded R"
 
-Or you can use standard markdown for static figures with the following syntax:
+## prevent warning when fork is used with MPI
+export OMPI_MCA_mpi_warn_on_fork=0
+export RDMAV_FORK_SAFE=1
 
-`![optional caption that appears below the figure](figure url){alt='alt text for
-accessibility purposes'}`
+# Fix for warnings from libfabric/1.12 on Karolina
+module swap libfabric/1.12.1-GCCcore-10.3.0 libfabric/1.13.2-GCCcore-11.2.0 
 
-![You belong in The Carpentries!](https://raw.githubusercontent.com/carpentries/logo/master/Badge_Carpentries.svg){alt='Blue Carpentries hex person logo with no text.'}
-
-## Math
-
-One of our episodes contains $\LaTeX$ equations when describing how to create
-dynamic reports with {knitr}, so we now use mathjax to describe this:
+time mpirun --map-by ppr:4:node Rscript hello_balance.R
+```
 
-`$\alpha = \dfrac{1}{(1 - \beta)^2}$` becomes: $\alpha = \dfrac{1}{(1 - \beta)^2}$
 
-Cool, right?
+:::::::::::::::::::::::::::::::::
 
 ::::::::::::::::::::::::::::::::::::: keypoints 
 
-- Use `.md` files for episodes when you want static content
-- Use `.Rmd` files for episodes when you need to generate output
-- Run `sandpaper::check_lesson()` to identify any issues with your lesson
-- Run `sandpaper::build_lesson()` to preview your lesson locally
+- Parallel R code distributes work
+- There is shared memory and distributed memory parallelizm
+- You can test parallel code on your own local machine
+- There are several different job schedulers, but they share many similarities so you can learn a new one when needed
 
 ::::::::::::::::::::::::::::::::::::::::::::::::
 
-[r-markdown]: https://rmarkdown.rstudio.com/