Lesson 7 Practice
This session will allow users to practice assessing FASTQ quality using data from the HBR-UHR study (see https://rnabio.org/module-01-inputs/0001/05/01/RNAseq_Data/).
First step is to make sure that the participant is signed onto Biowulf. If not then do the following to sign on. Remember to replace user with the participant's own Biowulf sign on ID.
Then change into the data
directory.
The practice data is stored in the folder hbr_uhr_b4b
in the folder /data/classes/BTEP
. Copy it to the data directory.
Change into the hbr_uhr_b4b
folder in the participant's data
directory.
Request an interactive session with 12 gb of RAM or memory and 10 gb of local temporary storage space.
What are the contents in hbr_uhr_b4b
?
Solution
drwxr-x---. 2 wuz8 wuz8 4096 Dec 20 15:41 reads
drwxr-x---. 2 wuz8 wuz8 4096 Dec 20 14:37 references
Make a directory called hbr_uhr_b4b_raw_qc
inside /data/user/hbr_uhr_b4b
.
Load FASTQC.
Run FASTQC for raw FASTQ files and save the results in hbr_uhr_b4b_raw_qc
. Stay in /data/user/hbr_uhr_b4b
for this.
Change into hbr_uhr_b4b_raw_qc
.
Load and run MultiQC to combine all of the FASTQC reports in hbr_uhr_b4b_raw_qc
. Name the MultiQC results with prefix hbr_uhr_b4b_raw_qc
.
Copy the hbr_uhr_b4b_raw_qc.html
MultiQC report to local Downloads
to view the report.
Solution
Open a new Terminal (Mac) or Command Prompt (Windows). Then change into the local `Downloads` folder.Based on the MultiQC report for the raw FASTQ files, what is the next step.