Dear Chris:
I am working on the UK Biobank genotyped dataset, which is based on GRCh37 coordinates.
Please see the screenshot below. There are a total of 39,431 SNPs after I run plink2 --pfile chrXY --filter-males --mind 0.02 --geno 0.02.
The the 36,581th SNP is the breaking point for part-1 and part-2, that is, the 3rd row of the following screenshot.
![屏幕截图 2024-07-05 064705.png](https://groups.google.com/group/plink2-users/attach/11c6494287f14/%E5%B1%8F%E5%B9%95%E6%88%AA%E5%9B%BE%202024-07-05%20064705.png?part=0.1&view=1)
Based on your explanation, it seems that these
39,431 SNPs belong to the Pseudo region, therefore, there is no real ChrY SNPs from this dataset.
Therefore, I could not use this extracted dataset to run Y-chromosome based phylogenetic analysis, correct?
Thank you very much & best regards,
JIE