Thank you very much for your quick reply.
The example g file from Impute2 and the g file resulting from converting the 23andme file using Plink were very similar. The snpID and the homozygous calls were the only obvious differences between them. There does not appear to be any difference between the g files, though Impute2 only works with its examples.
I recoded the chromosome with Plink as you suggested and obtained bed, bim and fam file types. I am not sure about installing gtool on a Windows computer.
I downloaded gtool. When I unzipped it, the file was not executable. I
am not really clear how to work with Unix in Windows. I have Cygwin64
installed. It would be great if I could follow your suggestion and do
the bed> gen file transfer. Impute2 might then run properly.
It would be very helpful if Impute2 had a method for splitting the files (especially the haplotype files). The genetic files (for example 1000 Genomes) are starting to become unmanageable. Uncompressed haplotype files from the latest 1000 Genomes download for single chromosomes average 3 GB. The Impute2 listserv is reporting that a new genome release will soon have a dataset of 20,000 people. I am very uncertain how to split the 1000 Genomes Haplotypes files into more manageable files, as these files are composed entirely of 0s and 1s. These files should be formatted. I am not clear whether my computer will be able to run Impute2 with the 3 GB haplotype files.
I reran the chr15 data using a .txt g file (instead of an .rtf file). My computer made it to the first MCMC iteration and then crashed (see attached Impute Run 4). This was the same result as with the Impute2 Run 1 from the previous post.
(I am worried that in run 1, the m file was for chromosome 15, though the g file was for the whole genome. In the command line -int 20.4e6 20.5e6, might be ambiguous for Impute2, as this interval could apply to all of the chromosomes. Yet, the program might not be confused by this. In Impute Run 4, the g file was only for chromosome 15, thus there would not be such ambiguity.)
This time the program recognized 1 individual (Impute2 Run 4). When I ran the program before as an .rtf (see attached Impute2 run2 from previous post), Impute2 detected 0 individuals, excluded all the SNPs, and then the program reported "There are no SNPs in the imputation interval, so there is nothing for Impute2 to analyze; the program will quit now."
Thank you for your help. If I can figure out how to use gtool, Impute2 might recognize the g file and run.