Hi,
I am trying to convert huge vcf.gz files into binary format to build ld matrices using --r square. By looking at the bp in the .bim file I noticed that only about the first 10% of snps for a given chromosome were being processed. No error is given - the job just ends, and then when I check the file it only has the first portion of the snps.
The number that is processed seem to go up as I allocate more memory, but I'm already giving hundreds of gbs and it doesn't seem close to getting to the end of the chromosome. I was wondering if there is any advice on how to get PLINK to successfully process these giant files.
Thanks so much!
Michael