The fast way to remove all the multiallelic SNPs in plink2

2,027 views
Skip to first unread message

Shicheng Guo

unread,
Mar 13, 2020, 12:01:13 PM3/13/20
to plink2-users
Hi All, 

Now, plink2 has the ability to save all the multiallelic SNPs. Please share the fast way to remove all the multiallelic SNPs in plink2. 

Thanks. 

Shicheng

Christopher Chang

unread,
Mar 13, 2020, 12:19:28 PM3/13/20
to plink2-users
plink2 --pfile ... --max-alleles 2 --make-pgen --out ...

takiy berrandou

unread,
Mar 15, 2020, 1:51:05 PM3/15/20
to Christopher Chang, plink2-users
Dear all,
I think this work only if ALT column contain 2 or more alleles. However, in most VCFs and Pfile (eg, 1000G in ressource ) the multiallelic variants are stored each one in different row with one REF and one ALT. So, in this case what would be the solution with Plink2 only? 
Previously, in case of VCF I used BCFtools with norm module to collapse multiallelic from multi rows to one row then i use plink2 --vcf ... --max-alleles 2 --make-pgen --out ... cause it is realy fast
Best regards

--
You received this message because you are subscribed to the Google Groups "plink2-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to plink2-users...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/plink2-users/b925dd4a-00b9-489d-9ef6-1a2b8698ef22%40googlegroups.com.

Christopher Chang

unread,
Mar 15, 2020, 1:57:41 PM3/15/20
to plink2-users
plink2 will have a function to join this type of multiallelic variant back together soon.  For now, you'd use "--export bcf", use bcftools norm to do the job, and then --bcf to retrieve the results.

On Sunday, March 15, 2020 at 10:51:05 AM UTC-7, takiy berrandou wrote:
Dear all,
I think this work only if ALT column contain 2 or more alleles. However, in most VCFs and Pfile (eg, 1000G in ressource ) the multiallelic variants are stored each one in different row with one REF and one ALT. So, in this case what would be the solution with Plink2 only? 
Previously, in case of VCF I used BCFtools with norm module to collapse multiallelic from multi rows to one row then i use plink2 --vcf ... --max-alleles 2 --make-pgen --out ... cause it is realy fast
Best regards

Le ven. 13 mars 2020 à 17:19, Christopher Chang a écrit :
plink2 --pfile ... --max-alleles 2 --make-pgen --out ...

On Friday, March 13, 2020 at 9:01:13 AM UTC-7, Shicheng Guo wrote:
Hi All, 

Now, plink2 has the ability to save all the multiallelic SNPs. Please share the fast way to remove all the multiallelic SNPs in plink2. 

Thanks. 

Shicheng

--
You received this message because you are subscribed to the Google Groups "plink2-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to plink2-users+unsubscribe@googlegroups.com.

takiy berrandou

unread,
Mar 15, 2020, 2:47:45 PM3/15/20
to Christopher Chang, plink2-users
Great news, thank you.
Takiy

Le dim. 15 mars 2020 à 18:57, Christopher Chang <chrch...@gmail.com> a écrit :
plink2 will have a function to join this type of multiallelic variant back together soon.  For now, you'd use "--export bcf", use bcftools norm to do the job, and then --bcf to retrieve the results.

On Sunday, March 15, 2020 at 10:51:05 AM UTC-7, takiy berrandou wrote:
Dear all,
I think this work only if ALT column contain 2 or more alleles. However, in most VCFs and Pfile (eg, 1000G in ressource ) the multiallelic variants are stored each one in different row with one REF and one ALT. So, in this case what would be the solution with Plink2 only? 
Previously, in case of VCF I used BCFtools with norm module to collapse multiallelic from multi rows to one row then i use plink2 --vcf ... --max-alleles 2 --make-pgen --out ... cause it is realy fast
Best regards

Le ven. 13 mars 2020 à 17:19, Christopher Chang a écrit :
plink2 --pfile ... --max-alleles 2 --make-pgen --out ...

On Friday, March 13, 2020 at 9:01:13 AM UTC-7, Shicheng Guo wrote:
Hi All, 

Now, plink2 has the ability to save all the multiallelic SNPs. Please share the fast way to remove all the multiallelic SNPs in plink2. 

Thanks. 

Shicheng

--
You received this message because you are subscribed to the Google Groups "plink2-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to plink2-users...@googlegroups.com.

--
You received this message because you are subscribed to the Google Groups "plink2-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to plink2-users...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/plink2-users/92939288-5c3c-49a0-9a39-105ddcd48e17%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages