Hi Mariana,
- for the Felsenstein correction, you put the *total* number of *invariable* sites, i.e. ASC_FELS{4382000}
- for the Stamatakis correction, you put the number of *invariable* sites with As, Cs, Gs, and Ts, i.e.
ASC_STAM{nA/nC/nG/nT}, where nA + nC + nG + nT = 4382000
Since you have whole-genome sequences, all those numbers can be easily computed by simple character counting.
Best,
Alexey
> I work with a genome size specie of 4.4 millions bp (or positions), I made whole genome sequencing, but after the SNP
> identification pipeline, I have an alignment.fasta file with ONLY variable sites (18000 variable positions)
> If I am right, when usign ASC_FELS the number of invariant sites will be (4382000=4400000-18000), is that ok??
> If I used instead ASC_STAM, I must define the number of those 4382000 positions that are A/C/G/T?? Is that ok?? I know
> the proportion of each nucleotide in the genome, but I must define the total variable As and so on...
>
> I am asking this, because I discussed the parameters used by a collegue, and he defined the invariant sites as the total
> position of the whole genome that are A/C/G/T, not considering the variable sites.. I mean he reported all the genome as
> invariant sites plus the variable sites present in the fasta file. Because of that I need to confirm if I am rigth or
> wrong!!
>
> tranks in advance!
> sincerely
> Mariana
>
>
> El domingo, 15 de abril de 2018, 20:22:13 (UTC+2), Alexey Kozlov escribió:
>
> Hi Sean,
>
> > I am working with RAD data and have a phylip file containing only variable sites. How do I set the number of
> invariant
> > sites using the correction --asc-corr=stamataki ? The manual says to use the partition file. So would I calculate
> the
> > number of total sites (T) and just create a partition file with /DNA, p1:1-T/? (and the invariant sites will be
> inferred?)
>
> T must be the number of (variable) sites in your PHYLIP file. Please see p. 44 of RAxML manual for the explanation how
> to specify the number of invariant sites:
>
>
https://sco.h-its.org/exelixis/php/countManualNew.php <
https://sco.h-its.org/exelixis/php/countManualNew.php>
>
> > Also, what is the best way to calculate the number of invariable sites from a vcf file?
>
> VCF file usually contain variable sites only (SNPs), so I don't think there is any possibility to *calculate* the
> number
> of invariant sites. However, this information could be theoretically specified in one of the header fields (but not
> sure
> about this).
>
> Best,
> Alexey
>
> --
> You received this message because you are subscribed to the Google Groups "raxml" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
raxml+un...@googlegroups.com
> <mailto:
raxml+un...@googlegroups.com>.
> For more options, visit
https://groups.google.com/d/optout.