Hi, I am brand new to PLINK so this might be a relatively simple question, but I was confused by the output of my log file after running PCA.
1) I initially ran --freq using the following lines of code:
" plink2 --bcf {input.bcf} --freq --set-missing-var-ids @:#[b37]\$r,\$a --allow-extra-chr --not-chr NC_048593.1 --out {output.afreq} 2> {log} "
2) Then I ran --pca with this code:
" plink2 --bcf {input.bcf} --read-freq {input.afreq} --pca --allow-extra-chr --not-chr NC_048593.1 --out {output.pca} 2> {log} "
Attached are the two log files for each of those runs. I am confused in the pca.log file why it says that 0 variants were loaded from the afreq file, and why it gives a warning that all 14057294 entries were skipped (due to missing variant IDs, mismatching
allele codes, and/or zero observations - what does that mean?). I plotted the resulting first and second PC axes and my samples are clustering according to our expectations, but the 0 variants loaded and warning raised a reg flag. Any guidance would be much appreciate, thank you!