I assembled the PacBio HiFi reads for the C. rogercresseyi Macho sample using hifiasm on Hyak using our Apptainer image: srlab-R4.4-bioinformatics-container-c3d3116.sif. Assembly took ~19hrs, which was significantly longer then the Hembra assembly (1.5hrs) (Notebook entry).
Now that both assemblies are complete, I’ll compare the two assemblies using QUAST.
# Make output directory, if it doesn't existmkdir--parents"${output_dir}"cd"${output_dir}"# Create an array of .fastq.gz filesfastqs=(${data_dir}/*.fastq.gz)# Print the result (optional, for verification)## newline-delimited formatecho"List of FastQs used for assembly:"printf"%s\n""${fastqs[@]}"\|tee input_fastqs.txt# Run hifiasm assembly"${hifiasm}"\-o "${hifiasm_asm}"\-t "${threads}"\"${fastqs[@]}"\>"${hifiasm_assembly_log}"2>&1
3 GFATOOLS
3.1 Primary contigs to FastA
This will extract a FastA file from the graph files output by hifiasm.