Lesson 10 Practice
In this session, participants will practice generating a gene expression matrix for the HBR-UHR data.
Sign onto Biowulf and change into the /data/user/hbr_uhr_b4b
folder.
Create a new folder called hbr_uhr_expression
.
Request an interactive session with 12 gb of RAM and 10 gb of local temporary storage space.
Generate a gene expression table using featureCounts
for all of the samples in one go. Change into /data/user/hbr_uhr_b4b/hbr_uhr_hisat2
for this.
Solution
Change back to the hbr_uhr_expression
folder after the gene express matrix has been generated.
Convert the gene expression matrix hbr_uhr_gene_expression.txt
to a CSV file without a header line containing featureCounts
information as well as with only the following columns:
- Gene name
- Columns for the expression of each sample
Do this using |
(or pipe) to avoid writing intermediate files.