Dear Michelle,
> I am very new to the phylogenetic analyses and will appreciate your help
> with the following 3 questions I have about the program. I searched the
> group postings here but was not able to find the direct answers to these
> questions.
>
> First, as an example, in the original file, I have two individuals (with
> ids 1045_3 and 1157_5) and 3 markers with each marker having two alleles:
>
> 1045_3 T T C T G A
> 1157_5 T T C C A A
>
> After converting this file to the fasta file format, I have
>
>> 1045_3
> TTCTGA
>> 1157_5
> TTCCAA
>
> *Is this the correct input file format for SNP data analyses for RAxML?*
RAxML can read FASTA and relaxed PHYLIP, the formats are specified in
the comprehensive manual which I'd please ask you to read:
http://sco.h-its.org/exelixis/resource/download/NewManual.pdf
>
> Second, since this is a diploid dataset, should I determine the *haplotype
> phase* first. In this example, will the results be different if I have
>
>> 1045_3
> *TTTCAG rather than TTCTGA*
>> 1157_5
> TTCCAA
yes, the result will be different
> Last, about the assumption that different sites (markers) evolve
> independently of each other, does this mean that the markers shouldn't be
> in *linkage disequilibrium* with each other?
phylogenetic likelihood models always assume that sites evolve
independently, but you can try to model these dependencies or
independences by having the corresponding markers evolve under the same
model, that is put them into a single or separate partitions ..
alexis
>
> These are very basic questions. Thank you for your attention and patience!
>
> Michelle
>
--
Alexandros (Alexis) Stamatakis
Research Group Leader, Heidelberg Institute for Theoretical Studies
Full Professor, Dept. of Informatics, Karlsruhe Institute of Technology
Adjunct Professor, Dept. of Ecology and Evolutionary Biology, University
of Arizona at Tucson
www.exelixis-lab.org