Course Overview
Biowulf is the Unix-based high-performance compute cluster at NIH and houses thousands of bioinformatics analyses prorgrams. While most are used to working with point-and-click operating systems such as Windows or Mac, working in a command-line driven environment such as Biowulf can be intimidating. This course series will help participant overcome fear of working on high-performance computing clusters so that they can start taking advantage of the resources available for their bioinformatics and data science needs.
Course Expectations / Learning Objectives
After this course, participants will be able to
- Log onto the NIH High Performance Compute Cluster Biowulf
- Navigate the folder and file (directory) structure on a Unix system
- Work with very large Next Generation Sequencing (NGS) files on a Unix system
- Find and load bioinformatics applications that are installed on Biowulf
- Run interactive, swarm and batch jobs on Biowulf
Course schedule and topical outline
- Lesson 1 (May 16th, 2023):
- Overview of Unix and Biowulf
- Logging into Biowulf
- Lesson 2 (May 23rd, 2023):
- Navigating around the Biowulf directory structure
- Lesson 3 (May 30th, 2023):
- Working with files and directories
- Interactive sessions
- Exploring Next Generation Sequencing data
- Lesson 4 (June 6th, 2023):
- Bioinformatics applications on Biowulf
- Submitting batch jobs
- Swarm
- Shell script