I assembled the PacBio HiFi reads for the C. rogercresseyi Hembra sample using hifiasm on Hyak using our Apptainer image: srlab-R4.4-bioinformatics-container-c3d3116.sif. Assembly was insanely fast; ~1.5hrs.
Once the other sea lice assembly has completed (Macho), I’ll compare the two assemblies using QUAST.
# Make output directory, if it doesn't existmkdir--parents"${output_dir}"cd"${output_dir}"# Create an array of .fastq.gz filesfastqs=(${data_dir}/*.fastq.gz)# Print the result (optional, for verification)## newline-delimited formatecho"List of FastQs used for assembly:"printf"%s\n""${fastqs[@]}"\|tee input_fastqs.txt# Run hifiasm assembly"${hifiasm}"\-o "${hifiasm_asm}"\-t "${threads}"\"${fastqs[@]}"\>"${hifiasm_assembly_log}"2>&1
3 GFATOOLS
3.1 Primary contigs to FastA
This will extract a FastA file from the graph files output by hifiasm.