./biology/vcf-split, Split a multi-sample VCF into single-sample VCFs

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2021Q2, Version: 0.1.2, Package name: vcf-split-0.1.2, Maintainer: bacon

Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands
of output files simultaneously. Parsing the TOPMed human chromosome 1 BCF
with bcftools takes two days, so extracting the 137,977 samples one at a time
or using thousands of parallel readers of the same file is impractical.
Vcf-split solves this by generating thousands of single-sample outputs during
a single sweep through the multi-sample input.


Master sites:

SHA1: 6ff5bd12fcbaea6fa6a0afd78cd2978cd5c25beb
RMD160: 9cf2087943e5869ed6a293c7e420d1f0ed8a85b4
Filesize: 15.322 KB

Version history: (Expand)