Data Received - C.bairdi RNAseq Day9-12-26 Infected-Uninfected

Previously, we “received” this data, but it turns out it was incomplete (see 20191003).

Today, we finally received all the RNAseq data (>50M reads per samples) back from NWGC that we submitted on 20190521!

The second round of data is in addition to the data we received on 20191003. So, to simplify some of the data management and downstream processing of these files, I decided to concatenate the two sets of file. Concatenation is documented in this Jupyter Notebook (GitHub):

20191024_swoose_cbai_fastq_concatenation.ipynb

Here’s a table with the library names and the FastQ naming schemes.

NWGC Sample ID	Investigator Sample ID
~~329772~~	~~D9_infected~~
~~329773~~	~~D9_uninfected~~
329774	D12_infected
329775	D12_uninfected
329776	D26_infected
329777	D26_uninfected

The two samples with strikeouts above failed sequencing. See the previous post from 20191003 about data delivery for all the info on those two samples.

Here’s the list of FastQ files:

329774_S1_L001_R1_001.fastq.gz
329774_S1_L001_R2_001.fastq.gz
329774_S1_L002_R1_001.fastq.gz
329774_S1_L002_R2_001.fastq.gz
329775_S2_L001_R1_001.fastq.gz
329775_S2_L001_R2_001.fastq.gz
329775_S2_L002_R1_001.fastq.gz
329775_S2_L002_R2_001.fastq.gz
329776_S3_L001_R1_001.fastq.gz
329776_S3_L001_R2_001.fastq.gz
329776_S3_L002_R1_001.fastq.gz
329776_S3_L002_R2_001.fastq.gz
329777_S4_L001_R1_001.fastq.gz
329777_S4_L001_R2_001.fastq.gz
329777_S4_L002_R1_001.fastq.gz
329777_S4_L002_R2_001.fastq.gz

All files have been added to nightingales/C_bairdi:

nightingales/C_bairdi

Will update nightingales/C_bairdi/readme.txt and Nightingales Google sheet