Hi,
Thanks for your reply. The
"plink.mdist.missing" file is a matrix :
".mdist.missing (identity-by-missingness matrix)
Produced by "--cluster missing".
A triangular space-delimited text file with identity-by-missingness coefficients."
But without sample IDs available as a column and row names of the matrix it is not possible to correctly interpret and visualize IBM distances between samples. How can i get the same file, but with sample IDs for further analysis of the matrix?
The log file is:
"
PLINK v1.90b6.26 64-bit (2 Apr 2022)
Options in effect:
--allow-extra-chr
--cluster missing
--keep-allele-order
--vcf Diploids_GBS.vcf
Hostname: node03
Working directory: /mnt/lustre/tkiy/Diploids_GBS/GBS_pipeline
Start time: Fri May 6 15:44:17 2022
Random number seed: 1651841057
48258 MB RAM detected; reserving 24129 MB for main workspace.
--vcf: plink-temporary.bed + plink-temporary.bim + plink-temporary.fam written.
109444 variants loaded from .bim file.
96 people (0 males, 0 females, 96 ambiguous) loaded from .fam.
Ambiguous sex IDs written to plink.nosex .
Using up to 23 threads (change this with --threads).
Before main variant filters, 96 founders and 0 nonfounders present.
Calculating allele frequencies... done.
Total genotyping rate is 0.476451.
109444 variants and 96 people pass filters and QC.
Note: No phenotypes present.
IBM matrix written to plink.mdist.missing .
Clustering... done.
Cluster solution written to plink.cluster1 , plink.cluster2 , and
plink.cluster3.missing .
"
Please let me know if my VCF file is still needed to replicate the issue. I'll try to prepare a toy example of my VCF for sending in that case.
Regards,
Denis