Skip to content

Module 1

Week 1 Introductions and File systems

L1 Why bioinformatics?
L2 Intro to statistics?

Lesson 1

  • Intro to Course, learning objectives
  • Intro to Unix
  • operating system
  • What is a shell (how to proceed Windows vs Mac)
  • What is conda?
  • Getting started with DNAnexus

  • Help Session: DNAnexus trouble shooting

Lesson 2: Understanding and navigating file systems

  • Navigating file systems
  • Best practices in file organization and naming conventions
  • Using unix (12 commands and more)

Week 2

Lesson 3 Intro to Biowulf

  • What is Biowulf? Explaining compute nodes, cores, CPUs, and the idea of connecting to a remote system
  • Why work on Biowulf?
  • Available software, increased computation power, high memory jobs, big data
  • Connect to Biowulf
  • Explain Biowulf file systems (home, data, lscratch)
  • Biowulf module system
  • Safeguards, getting help
  • Help Session:

Lesson 4: Working on Biowulf

  • Slurm system: batch jobs, swarms jobs, interactive sessions
  • Introduce parallel command
  • Retrieve data from NCBI
  • Trouble shooting jobs and job failures
  • Help session: submitting jobs on Biowulf

Week 3

Lesson 5 Review and Downloading and organizing files for RNASeq files

  • Review (30 min)
  • file compression and data set introduction (30 min)
  • setting up project folders

  • Help session: Coding scavenger hunt?

Lesson 6 Pattern searching with grep and egrep

Week 4

Lesson 7 Sequencing Facilities and data types (Des)

Lesson 8 RNA-Seq from Experimental design to analysis (Peter)

Notes: linux - how to get help (man, flags) file permissions - changing permissions include grep and regular expressions

Week 5

Lesson 9 - quality check (fastqc)

Lesson 10 - adapter trimming (fastqc), multiqc

Week 6

Lesson 11 - Strandedness