Hi, I am hoping someone can help me with this.
I am trying to take a vcf file and remove loci that fail a HW test with p <= 0.001 and remove loci missing >5% genotypes.
However, while I can do that, it appears that Plink is renaming my sample IDs. For example, if the original ID is "1234" the new id is "1234_1234".
The code I am writing is:
plink --vcf file_input --make-bed --hwe 0.001 --geno 0.05 --out out_file
Any help would be much appreciated!