Applications¶

In this section, we'll talk a little bit about how to use different applications on our clusters.

A Simple Python Program¶

First, we have an example python script that we will be using just as a test problem for our sessions this week as well as in week 4.

Please use your favorite command line text editor (like vim or nano) to add this file to your cluster account and save it as example.py.

example.py
import numpy as np

A, B = np.random.rand(2, 5_000, 5_000)
C = np.matmul(A, B)

print(C.mean())

copy/paste in the terminal

To copy in a terminal, use ctrl+shift+c

To paste in a terminal, use ctrl+shift+v

vim commands

vim example.py to open this file in vim
a to enter edit mode
ctrl+shift+v to paste file contents
esc to exit insert mode
:wq to save and quit

This script creates two random matrices, of size five thousand by five thousand and multiplies them together. It then prints out the mean of the resultant matrix. It's not useful scientifically but it does take some time for the computer to do this problem, so it's helpful to use as a toy problem.

Next, let's try running it:

1 2	`$ python example.py -bash: python: command not found`

Wait, what? Why didn't that work? The system doesn't know about python yet, we haven't loaded it.

Module System¶

There are too many versions and conflicting software to have every version of every application pre-installed for all users all the time. To get around this problem, we use a module system called Lmod. This module system can load and unload programs and commands within your shell environment.

Image showing three module load commands making additional commands available

As an example, run the command module list to list all currently loaded modules:

$ module list

Currently Loaded Modules:
1) gmp/6.2.1 3) mpc/1.1.0 5) gcc/12.2.0 7) openmpi/4.1.4 9) rcac -> modtree/cpu
2) mpfr/4.0.2 4) zlib/1.2.13 6) numactl/2.0.14 8) xalt/3.0.2 (S)

Where:
S: Module is Sticky, requires --force to unload or purge

There are many different module commands that we can use to learn about what's available on the system and what our current environment is:

Command	Description
`module list`	List currently loaded modules
`module load`	Load a module by name (and version)
`module unload`	Unload an already loaded module
`module avail`	Search for currently available modules
`module spider`	Recursively search the entire module tree
`module purge`	Unload all currently loaded modules
`module reset`	Revert to the default module set
`module show`	Show the module definition
`module help`	Show help message for a module

If we want to run a python script, we'll need to load a module that provides the python command. On RCAC clusters, we provide python through the conda environment manager, which is available as a module.

$ module avail conda

---- Core Applications ----
conda/2024.09

Now let's load conda to get our python loaded in!

$ module load conda

$ which python
/apps/external/apps/conda/2024.09/bin/python

which command

which is a nice program that will tell us where the specified program is coming from. Remember that everything is a file! which tells you what file starts the program when you run a command.

Now that we have python ready and our script is written, let's run it:

1 2	`$ python example.py 2499.9118`

Numpy error

On some RCAC systems, the numpy library isn't available in the "base" conda environment, and you may need to make your own python environment, and you may see errors like this:

Traceback (most recent call last):
File "/home/username/example.py", line 1, in <module>
import numpy as np
ModuleNotFoundError: No module named 'numpy'

We're going to make our own conda environment to install numpy for ourselves. Run these three lines of code to create the environment, activate it, and then run our example:

$ conda create -y -n example_env numpy
$ conda activate example_env
(example_env) $ python example.py
2499.9118

The first line creates a new conda environment named example_env, and automatically installs the numpy package in it. The second line changes your shell behavior so that it is using your new conda environment. You can tell that the environment is activated as conda will alter your prompt to contain the environment name. If you check which python you are using after you run conda activate example, you'll see that now your using a python that's installed in your home directory!

$ conda activate example_env
(example_env) $ which python
~/.conda/envs/example/bin/python

conda environments like this are often an easy way to install packages and libraries without the need for sudo (admin) privileges.

Putting it into a Script¶

Notice that it took several shell commands to run this python program. If we don't want to type out all the commands every time we want to run, we can put them into a script! We'll talk about scripting more in week 3, but for now we can think of a script as a series of commands that we put into a file, that are all ran when we run the script. For example, if we put our commands in a file titled run_example.sh:

run_example.sh
#!/bin/bash
module load conda
conda activate example_env
python example.py

We can run the whole script on the command line:

$ bash run_example.sh
2499.9118

Remember that when we log onto the clusters, we are typically placed on the login nodes, which aren't meant for computationally expensive tasks. This is a computationally intensive task that we should instead run on the compute nodes, which we'll cover in the next section!

Next Section: Cluster Job Submission