Steven asked that I split up a Crassostrea virginica VCF file:
Overview of process:
Downloaded file.
Gunzipped file.
Sorted and bgzipped file.
Indexed sorted file with
tabix
.Filled AN/AC values with
bcftools
AN/AC fill plugin.BCFtools to split sorted/filled VCF in to individual VCF files.
The entire process is documented in the Jupyter Notebook linked below.
Jupyter Notebook (GitHub):
RESULTS
Output folder:
Will let Steven know this is complete and close out GitHub issue.
Wow! This took a very long time to run; slightly more than four days for the entire process!