Well, I continue to struggle to makek progress on assembling the geoduck Illumina NovaSeq data. Granted, there is a ton of data (374GB!!!!), but it’s still frustrating that we can’t get an assembly anywhere…
Here are some of the struggles so far:
Can’t run locally because:
Ran out of hard drive space - due to hardware limitations of our Apple Xserve
Fixed HDD space issue, but Roadrunner locks up and has to be restarted; no error message(s) in log files to help troubleshoot
Can’t run on Mox because:
- Can’t figure out how to install needed dependencies that don’t already exist on Mox. More specifically, friggin’ Boost libraries! Trying to install these properly has been an issue in the past for non-Mox computers, too. I remember a few times discussing the pain of installing Boost with Sean Bennett.
- Our Mox node can’t handle the memory requirements needed for assembly.
Can’t install one of the dependencies (SOAP error correction)
Actually, I need to try the binary version of this, instead of the source version (the source version fails at the
make
step)
So, next up will trying the following two assemblers:
JR-Assembler: Will see if SOAPec binary will work, and then run an assembly.
AllPaths-LG: I was able to install this successfully on Mox.
Additionally, we’ve ordered some additional hard drives and will be converting the old head/master node on the Apple Xserve cluster to Linux. The old master node is a little better equipped than the other Apple Xserve “birds”, so will try to re-run Meraculous on it once we get it converted.