INTRO
We received PacBio data back from Angie Shmoldt at the Univ. of Wisconsin-Milwaukee as part of Rick’s Salvelinus namaycush sequencing project.
Data was downloaded and MD5 checksums were verified.
Data was downloaded via sftp
using the program lftp
, as it was the only way I could figure out how to recursively download the data via command line (SSH access was not an option). Here’s the example:
lftp sftp://user@hostname
lftp> mirror /remote/directory /local/destination/
Data is available here:
https://owl.fish.washington.edu/nightingales/S_namaycush/LakeTrout/
Data
Barcoding information can be found in this Excel file:
Pool 1 | Sample | Adapter barcode |
---|---|---|
Lean | FA047 | bc2041 TATGATCACTGAGTAT |
Siscowet | FA097 | bc2071 CGAGTCTAGCGAGTAT |
Siscowet | FA099 | bc2072 TATCAGTAGTGAGTAT |
Pool2 | Sample | Adapter barcode |
---|---|---|
Lean | FA094 | bc2069 TCTATGACATGAGTAT |
Lean | FA095 | bc2070 TACTGCTCACGAGTAT |
Siscowet | FA100 | bc2073 ATCACTAGTCGAGTAT |
There are two “pools” of data, which exist as BAM files:
Pool 1
BAM
MD5:
58f6d02a4a8e4a96a9c67cead15f2363
Report (PDF)
Pool 2
BAM
MD5:
a406ece54be24ca78257109a57c367a8
Report (PDF)
Checksum verification
No MD5 checksums were provided, however, the files were hosted on a Synology server, so I was able to generate checksums using the web interface. I manually compared those with the checksums I generated for the files once they were on Owl.