Outline
Course setup
-
Intro to DNAnexus and the GOLD learning system
-
Learning Objectives
-
Unix Bootcamp
-
Brief Review of R (for R scripts in later analyses)
Introduction to RNA Sequencing
-
Central Dogma of Molecular Biology
-
What is RNA Sequencing?
-
How to Design an RNA-Seq Experiment
-
Biological and Technical Replicates
Steps of an RNA-Seq Data Analysis
-
Getting the data
-
Unpacking the data
-
QA/QC with FASTQC and MULTIQC
-
Adapter trimming
-
Alignment
-
Getting a Reference Genome
-
Genome Indexing
-
Spliced reads
-
Non-spliced reads
-
No Reference Genome? Working with a Transcriptome (non-model organisms)
-
Pseudo-alignment (kallisto, salmon)
-
Quantification of Reads (gene, transcript)
File Formats for RNA-Seq
- FASTA
- FASTQ
- GTF
- SAM/BAM
RNA-Seq test dataset(s) (toy datasets, HBR/UHR with spike-in or other, GEO)
- toy dataset
- HBR/UHR with spike-in
- GEO
Introduction to Differential Expression Analysis
Computing differential expression (DESeq2, edger)
Viewing the results (heatmaps)
Meaningful results
Pathway Analysis
RNA Isoform Analysis
Course wrap-up
- RNA-Seq analysis on the NIH High-Performance Unix Cluster Biowulf
- NCI/CCR resources for RNA-Seq Analysis