Path to this page:
./
biology/vcf-split,
Split a multi-sample VCF into single-sample VCFs
Branch: pkgsrc-2021Q2,
Version: 0.1.2,
Package name: vcf-split-0.1.2,
Maintainer: baconVcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands
of output files simultaneously. Parsing the TOPMed human chromosome 1 BCF
with bcftools takes two days, so extracting the 137,977 samples one at a time
or using thousands of parallel readers of the same file is impractical.
Vcf-split solves this by generating thousands of single-sample outputs during
a single sweep through the multi-sample input.
Master sites:
SHA1: 6ff5bd12fcbaea6fa6a0afd78cd2978cd5c25beb
RMD160: 9cf2087943e5869ed6a293c7e420d1f0ed8a85b4
Filesize: 15.322 KB
Version history: (Expand)
- (2021-07-01) Package added to pkgsrc.se, version vcf-split-0.1.2 (created)