Hi Khadija,
I have a .txt file with SNPs. I intend to perform clumping. I started off using LDlinkR package in R. LDmatrix has a limit of 3000SNPs and therefore I had to split the data manually, verifying the chromosomes and positions being included. My .txt file has information on SNPID, Chromosome, Position, Pvalue, tested and other allele. I was able to form the LD matrix and use R square threshold to filter out the SNPs that are correlated. I am stuck from there, and I am unable to get a working code that effectively filters out correlated SNPs.
I want to try PLINK as I heard its faster. However, I wasnt able to load the .txt file. I went back and got the .gz file from the 1000genomes dataset. However, I am unsure how to load it. PLINK documentation only talks about .bed, .ped or .map files.
Thank you,
Jyothi.