./biology/vcf-split, Split a multi-sample VCF into single-sample VCFs

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 0.1.1, Package name: vcf-split-0.1.1, Maintainer: bacon

Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands
of output files simultaneously. Parsing the TOPMed human chromosome 1 BCF
with bcftools takes two days, so extracting the 137,977 samples one at a time
or using thousands of parallel readers of the same file is impractical.
Vcf-split solves this by generating thousands of single-sample outputs during
a single sweep through the multi-sample input.


Master sites:

SHA1: 550fefb4c07d4632405e94127a19e98031ac0067
RMD160: 76a1b0b5a8934949d39e69ebe719c8f0ba247a13
Filesize: 13.893 KB

Version history: (Expand)


CVS history: (Expand)


   2021-03-24 16:22:29 by Jason Bacon | Files touched by this commit (4)
Log message:
biology/vcf-split: import vcf-split-0.1.1

Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands
of output files simultaneously.  Parsing the TOPMed human chromosome 1 BCF
with bcftools takes two days, so extracting the 137,977 samples one at a time
or using thousands of parallel readers of the same file is impractical.
Vcf-split solves this by generating thousands of single-sample outputs during
a single sweep through the multi-sample input.