Randomly mask n% of the dataset

16 views
Skip to first unread message

Anna Dela Cruz

unread,
Dec 23, 2025, 8:12:24 AM12/23/25
to plink2-users
Greetings!

I am very new to bioinformatics and am trying to learn how to make new datasets from existing ones. I was wondering if PLINK1.9 or 2.0 can randomly mask n% of my dataset? I was reading about --simulate-missing <missing geno freq> but am not sure if I can direct that to a specific data set.

Would really appreciate the help :)

Thank you very much!

Christopher Chang

unread,
Jan 10, 2026, 5:26:18 PM (3 days ago) Jan 10
to plink2-users
There is currently no PLINK command to directly set a given fraction of genotype calls to missing.  However, --zero-cluster can be used for masking if you provide a list of (variant, sample-group)s.
Reply all
Reply to author
Forward
0 new messages