Data Received - Ostrea lurida MBD-enriched BS-seq

Received the Olympia oyster, MBD-enriched BS-seq sequencing files (50bp, single read) from ZymoResearch (submitted 20151208). Here’s the sample list:

  • E1_hc1_2B

  • E1_hc1_4B

  • E1_hc2_15B

  • E1_hc2_17

  • E1_hc3_1

  • E1_hc3_5

  • E1_hc3_7

  • E1_hc3_10

  • E1_hc3_11

  • E1_ss2_9B

  • E1_ss2_14B

  • E1_ss2_18B

  • E1_ss3_3B

  • E1_ss3_14B

  • E1_ss3_15B

  • E1_ss3_16B

  • E1_ss3_20

  • E1_ss5_18

The 18 samples listed above had previously been MBD-enriched and then sent to ZymoResearch for bisulfite conversion, multiplex library construction, and subsequent sequencing. The library (multiplex of all samples) was sequenced in a single lane, three times. Thus, we would expect 54 FASTQ files. However, ZymoResearch was dissatisfied with the QC of the initial sequencing run (completed on 20160129), so they re-ran the samples (completed on 20160202). This created two sets of data, resulting in a total of 108 FASTQ files.

ZymoResearch data portal does not allow bulk download of files. However, I ended up using Chrono Download Manager extension for Google Chrome to allow for automated downloading of each file (per ZymoResearch recommendation).

After download, the files were moved to their permanent storage location on Owl:

The file was updated to include project/file information.

The file manipulations were performed in a Jupyter notebook (see below).

Total reads generated for this project: 1,481,836,875

Jupyter Notebook file: 20160203_Olurida_Zymo_Data_Handling.ipynb

Notebook Viewer: 20160203_Olurida_Zymo_Data_Handling.ipynb