Yesterday, I ran FastQC/MultiQC on the Crassostrea virginica MBD BS-seq data from ZymoResearch. Steven wanted to trim it and see how things turned out.
I ran TrimGalore (using the built-in FastQC option) and followed up with MultiQC for a summary of the FastQC reports.
TrimGalore job script:
Standard error was redirected on the command line to this file:
MD5 checksums were generated on the resulting trimmed FASTQ files:
All data was copied to my folder on Owl.
Checksums for FASTQ files were verified post-data transfer.
Results:
Output folder:
FastQC output folder:
MultiQC output folder:
MultiQC HTML report:
Overall, the auto-trim didn’t alter things too much. Specifically, Steven is concerned about the variability in the first 15bp (seen in the Per Base Sequence Content section of the MultiQC output). It was reduced, but not greatly. Will perform an independent run of TrimGalore and employ a hard trim of the first 14bp of each read and see how that looks.