plink2 merge

24 views
Skip to first unread message

Michael Harris

unread,
Apr 17, 2026, 9:42:57 AM (9 days ago) Apr 17
to plink2-users
Hi Chis,

Do you have any plans to implement merge with non-overlapping samples?  Bcftools merge can be quite slow when merging large numbers of individual VCF files.  

We are sequencing bacterial genomes and I was hoping to use plink 2 to merge around 30,000 individual VCF files into one merged multi-sample VCF.   

I would appreciate any advice that you have.  Plink2 performance and functionality  has been amazing for analyzing our samples. Thanks for providing this fantastic tool to the community!

Michael 

Christopher Chang

unread,
Apr 20, 2026, 2:25:54 AM (6 days ago) Apr 20
to plink2-users
If the non-overlapping samples have exactly the same variants in the same order, and they're all biallelic, you can use the script I posted at https://groups.google.com/g/plink2-users/c/IdG7d4oSehI/m/9TQhKVDxBQAJ .

Michael

unread,
Apr 22, 2026, 9:18:42 AM (4 days ago) Apr 22
to plink2-users
Thanks this is helpful!  Do you think this strategy could be used on a compute cluster with multiple nodes doing part parts of the merge with subsets of the samples  and then having a single node do the final merge of the smaller merged files ?

Thanks!

Michael 
Reply all
Reply to author
Forward
0 new messages