Hi Chris,
I have some data I was given in EIGNESOFT format. I need to merge it with my cohort data for analysis. My data is in plink format. I ran CONVERTF to convert the data I was given into PED format but it isn't exactly compatible yet. The .pedsnp (.map) file looks like this, which is fine:
1 rs3094315 0.020130 752566 G A
1 rs7419119 0.022518 842013 T G
1 rs13302957 0.024116 891021 G A
1 rs6696609 0.024457 903426 C T
1 rs8997 0.025727 949654 A G
1 rs9442372 0.026288 1018704 A G
1 rs147606383 0.026665 1045331 G A
1 rs4970405 0.026674 1048955 A G
1 rs11807848 0.026711 1061166 T C
1 rs4970421 0.028311 1108637 G A
The first line of the .ped output from CONVERTF looks like this:
1 sampleID 0 0 1 1 3 3 4 4 1 1 4 4 1 1 3 3 3 3 1 1 4 4 1 3 1 1 2 4 4 4 3 3 2 2 2 2 1 1 2 4
I see you are also an author on EIGENSOFT. I hope you can give me some advice. Is there a recode option in plink that can handle the alleles being designated 1, 2, 3, 4? My aim is to make this compatible with my data and merge.
Also, I will have to do some text editing since the first 6 columns aren't in ped format. the .pedind file looks like:
1 SampleID 0 0 1 1
From the CONVERTF documentation I believe only FID and IID are preserved in the .ped output but I wonder if you know for sure whether columns 3 and 4 in the .ped are genotypes, or if they come from the .ind file.
Thanks for all your support..