Earlier today, we received the additional G.gigas sequencing data from Genewiz. Wanted to run through FastQC again and get an updated report for each data set. Admittedly, it probably won’t look much different from the initial FastQC run on 20190415, due to the fact that the additional sequencing was simply appended to the previous data. Since FastQC examines a subset of the data in each file, I’d fully expect the FastQC report to look the same. However, we’ll have a greater number of sequences in each file. This should, in turn, increase the number of reads retained after quality trimming.

Ran FastQC, followed by MultiQC on my computer (swoose).

Used this bash script:

20190501_cgig_fastqc_wgbs_roberto.sh

RESULTS

Output folder:

20190501_cgig_fastqc_wgbs_roberto

MultiQC Report (HTML):

20190501_cgig_fastqc_wgbs_roberto/multiqc_report.html

As predicted, the “Per Tile Sequence Quality” test still fails, but the additional reads should help make up for the sequence loss we might experience during quality trimming: