So I created a frqx file from a 80 sample Joint Called WGS VCF, then reran plink --check-sex with the --freq parameter. At first it complained about duplicate IDs (.), so I removed all rows from the VCF with . as its ID, then reran.
./plink --bfile A --check-sex ycount --out A --allow-extra-chr --read-freq warehouse_1.0.frqx
(C) 2005-2015 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to A.log.
Options in effect:
--allow-extra-chr
--bfile A
--check-sex ycount
--out A
--read-freq warehouse_1.0.filtered.frqx
7450 MB RAM detected; reserving 3725 MB for main workspace.
4461279 variants loaded from .bim file.
1 person (0 males, 0 females, 1 ambiguous) loaded from .fam.
Ambiguous sex ID written to A.nosex .
Using 1 thread (no multithreaded calculations invoked.
Before main variant filters, 1 founder and 0 nonfounders present.
Calculating allele frequencies... done.
Warning: Nonmissing nonmale Y chromosome genotype(s) present; many commands
treat these as missing.
Total genotyping rate is 0.993504.
Error: Allele(s) on line 31 of --read-freq file don't match loaded
values.
Do I need to subset the VCF to only SNP IDs found in the frqx and ensure they are in the same order?
Thanks!
Aaron