FastQC and Trimming - Metagenomics (Geoduck) HiSeqX Reads from 20180809 – Sam’s Notebook

Author

Sam White

Published

December 11, 2018

Steven tasked me with assembling our geoduck metagenomics HiSeqX data. The first part of the process is examining the quality of the sequencing reads, performing quality trimming, and then checking the quality of the trimmed reads. It’s also possible (likely) that I’ll need to run another round of trimming. The process is documented in the Jupyter Notebook linked below. After these reads are cleaned up, I’ll transfer them over to our HPC nodes (Mox) and try assembling them.

Jupyter Notebook (GitHub):

20181212_emu_geo_metagenomics_fastqc_trimgalore.ipynb

RESULTS

Samples required three rounds of trimming:

Initial quality/adapter trimming.
Remove funky 5’ 10bp from each read.
Remove funky 5’ 10bp from each read (again? maybe I misread the number of bases needing to be trimmed from the previous trimming?)

Now that the reads are cleaned, will transfer triply-trimmed data to Mox for assembly.

Output folder:

20181211_metagenomics_fastqc_trimgalore/

Initial FastQC folder:

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_fastqc/

MultiQC Report (HTML):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_fastqc/multiqc_report.html

TrimGalore! folder (initial qualitry trim):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_01/

Post-trimming FastQC folder (first round):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_01/20181211_metagenomics_trimmed_fastqc/

MultiQC Report (HTML):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_01/20181211_metagenomics_trimmed_fastqc/multiqc_report.html

TrimGalore! folder (second round, 10bp trim):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_02

Post-trimming FastQC folder (second round, 10bp trim):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_02/20181211_metagenomics_trimmed_fastqc/

MultiQC Report (HTML):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_02/20181211_metagenomics_trimmed_fastqc/multiqc_report.html

TrimGalore! folder (third round, 10bp trim):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_03

Post-trimming FastQC folder (second round, 10bp trim):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_03/20181211_metagenomics_trimmed_fastqc/

MultiQC Report (HTML):

20181211_metagenomics_fastqc_trimgalore/20181211_metagenomics_trimgalore_03/20181211_metagenomics_trimmed_fastqc/multiqc_report.html