Hi,
I'm converting a vcf file to PLINK library by using the following command:
plink --vcf test.vcf.gz --biallelic-only --make-bed --out test
However, since the ID field in the test.vcf.gz file is missing,
#CHROM POS ID REF ALT QUAL FILTER
1 1 . C A 501 PASS
1 50 . C T 415 PASS
1 100 . C T 999 PASS
1 302 . T TA 999 PASS
1 308 . C T 43.80 FAIL
1 400 . T C 102 PASS
The resulting test.bim file gives:
1 . 0 1 A C
1 . 0 50 T C
1 . 0 100 T C
1 . 0 302 TA T
1 . 0 308 T C
1 . 0 400 C T
Is there a feature in PLINK1.90 that can detect missing IDs in the vcf file and output 'CHR-POS' as the default variant ID? My vcfs are quite big, so it would be great if I can keep the intermediate file manipulation to the minimal. I would imagine this is quite similar to the --id-delim flag when converting sampleIDs to FIDs and IIDs. This will be quite useful when using vcf as a direct input and performing tasks such as and pruning, so that the output won't be a list of variant IDS equal to '.'
Many thanks,
Yang