> From: Erik
> Sent: 27 January 2015 14:23
> So the Ward & Wilson test would be more relevant for R_Combine (pre-
> calibration), right?
Yes.
> "after calibration the uncertainties are large enough that they cannot
> be distinguished."
> The high A index is confirming this, right?
> And by extension, it is reasonable to combine the dates (despite the
> low chi-score score), right?
>
> What are you getting out of the Difference command? I don't understand
> why you used that in this case.
The A-index is sort of indicating this. A low A-index indicates conflict between your data and your prior model and a high one lack of conflict. It is designed to be used with the 60% cut-off just like a p-value in the W&W test. But like the p-value from the W&W test it doesn't do anything else; it is just a Classical statistical decision criterion. The A-index is also an approximate calculation for the comparison of models with data, and is not a measure with a precise statistical meaning.
The Difference is an explicit calculation of the difference between the two dates, with a precise statistical meaning. Just as a confidence interval in a classical statistical framework is more informative than a p-value, the credible interval on the difference is more informative. So a difference of -30 to 290 years at 95% probability tells me that there is no strong evidence to reject the difference of zero (which is assumed by Combine), but the difference might be more than a couple of hundred years (in which case Combine might lead me astray if I am working with high-precision results). Difference also has the advantage that because it is a Bayesian posterior probability distribution it can be manipulated further. Usually we only want to combine dates which are contemporary, but a zero year difference is not the only possible definition of contemporaneity. For example, it might be good enough for my purposes to define contemporaneity as dates within 25 years of one another (perhaps on the grounds that this is less than a human generation or the resolution of an established periodisation I will compare the results to). From the Difference probability distribution I can calculate the probability that the difference is within 0±25 years, and base my judgement on that. This approach allows me to evaluate the relative strengths of two hypotheses: contemporary within acceptable limits versus different. In the case of your two dates there is only ~15% chance that the difference is 0±25, and ~85% chance that is it larger. So the odds are about 5.8:1 that the difference is more than 25 years. With an A-index or p-value you could only have said that the data was consistent with the null hypothesis of no difference at 95% confidence, and nothing about the alternative hypothesis. That's one of the big advantages of Bayesian statistics over Classical statistics :-)