Hi Joseph,
I only just realised that you've answered some of my questions - I just posted another topic about my ESS values, but would love to follow up with you about some of the stuff you replied to as well.
You said it would be okay to add blank sequences into my file rather than cutting out data from my COI gene. The reason I thought this wouldn't be great is that it is introducing a lot of 'unknowns' and I thought this might reduce the ability to determine informative relationships between my species, but this was just a guess rather than based on some sort of information. So do you believe that the blank data won't reduce the power of the analysis?
The other part was about estimating my rates. The reason I have ticked all of the estimate boxes is because of the several different rates that have been suggested for two of my genes (COI and 16S). For COI my rates are between 0.007-0.01/myr and 16S is 0.00265-0.0045/myr based on publications for similar organisms. I have set these as the upper and lower limits in my "priors" tab, while putting the 'middle' rate (i.e. a rate in the middle of my upper and lower limits) in the "clocks" tab and leaving the estimate box ticked. If I don't leave the box ticked I can no longer set upper and lower limits on the rate. I'm not sure how to work around this?
My last question for you is sort of a general question as to whether the set-up I currently have is suitable for what I'm trying to do. I've never used Beast before at all, so I've done as much reading as I can but some things still seem like a bit of guess work, so maybe you can help. Basically what I'm trying to do is build a combined-marker species tree to infer the relationships between closely related organisms. I don't know where the species boundaries lie (this is one of the questions I'm trying to answer), I don't have any calibration points for divergence times (I would like to get an estimate of when certain groups diverged), and not much is known about these organisms in general. My data format is like this: I have 5 partitions, I've linked my mtDNA trees, my "traits" are each population (which could potentially be a separate species, though not necessarily - this is what I need to find out), my "sites" are all TN93 with one HKY (some with +I +G or just +G), my "clocks" are all lognormal relaxed, my "trees" are all Yule Process with a Piecewise linear & constant root with a Random starting tree, I've set all of my ucld.mean to uniform for all genes, and my MCMC is 100 million (which takes at least 2 days to run).
I'm not sure if you can answer whether all of these parameters are appropriate for what I'm trying to do, but any insight would be great. I've attached the xml file as well.
Thanks so much for your help already, and any more would be great :)
Kat