Bioinformatics Training and Education Program (BTEP)
/
) at the top.pwd
= print working directoryls
= list contentscd
= change directorymkdir
, rmdir
= make directory; remove directoryrm
= remove filetouch
= create filenano
opens text editor
Content for this presentation is from hpc.nih.gov. For more information and more detailed training documentation, see hpc.nih.gov/training/.
Do not put data with PII (personally identifiable information), patient data for example, on Biowulf.
Helix is used for data transfers and file management on a large scale.
Transfering small files (< 1 GB) can be done from Helix, Biowulf login node, Globus, or hpc drive.
In general, Helix should be used when
scp
For more information on data transfers see hpc.nih.gov.
Connect remotely to Biowulf using a secure shell (SSH) protocol.
The login node will be used to submit jobs to run on the compute nodes that make up Biowulf.
ssh username@biowulf.nih.gov
“username” = NIH/Biowulf login username.
- If this is your first time logging into Biowulf, you will see a warning statement with a yes/no choice. Type “yes”.
- Type in your password at the prompt. NOTE: The cursor will not move as you type your password!
Login nodes should be used for submitting jobs.
Other additional uses of login nodes include:
Editing/compiling code
File management (small scale)
File transfer (small scale)
- You may request more space on /data, but this requires a legitimate justification.
Data storage on the HPC system should not be for archival purposes.
Use
checkquota
- this shows the directories for which you have write access
OR
Look on the *user dashboard -> disk storage.
*Only works on VPN
A snapshot is a view of the directory at a specific point in time.
Example of accessing a snapshot:
cd /home/$USER
ls -a
cd .snapshot
Create a directory in scratch:
mkdir /scratch/$USER
$USER
is an example of an environment variable.Environment variables contain user-specific or system-wide values that either reflect simple pieces of information (your username), or lists of useful locations on the file system. — Griffith Lab
We can display these variables using echo
.
echo $USER
echo $HOME
$PATH
is an important environment variable.echo $PATH
You will likely need to add to your $PATH
at some point in the future.
To do this use:
export PATH=$PATH:/path/to/folder
This change will not remain when you close the terminal. To permanently add a location to your path, add the above line to your bash shell configuration file, ~/.bashrc
.
Many Bioinformatics programs are available on Biowulf via modules.
To see a list of available software in modules use
module avail
module avail [appname|string|regex]
module –d
To load a module
module load appname
module load appname/version
To see loaded modules
module list
To unload modules
module unload appname
module purge #(unload all modules)
Note: you may also create and use your own modules
There is a module on Biowulf called biostars
.
This module was created by the BTEP team and is not accessible in the Biowulf module system.
To load this module use:
module use /data/classes/BTEP/apps/modules
module load biostars
See more information in the course docs, Biostars on Biowulf
slurm
sbatch
or swarm
CPUs and memory should be designated at the time of job submission.
MORE ON THIS LATER
Existing safeguards make it nearly impossible for individual Biowulf users to irreparably mess up the system for others.
WORST CASE SCENARIO - You are locked out of your account pending consultation with NIH HPC staff
- Can view disk usage and job info
- Request more disk space
- Evaluate job info for troubleshooting
Biowulf is the high performance computing cluster at NIH.
When you apply for a Biowulf account you will be issued two primary storage spaces:
/home/$User
(16 GB)/data/$USER
(100 GB).Hundreds of pre-installed bioinformatics programs are available through the module
system.
Computational tasks on Biowulf should be submitted as a job (sbatch
, swarm
) or through an interactive session (sinteractive
).
Do not run computational tasks on the login node.
Disk Storage, Checkquota, Snapshots