I thought you would elicit some sort of Bayesian answer,
but that hasn't happened.
Bayesian computation uses a "prior distribution" and
comes up with a combined, Bayesian estimate -- but that
is not the same, exactly, as reporting Mean and SD.
And I'm not a bayesian advocate, nor am I up-to-speed
on what they are doing, but my impression is that the
results, in terms of narrowing or modifying the estimators
is ordinarily of the magnitude that you get by adding a
total of 1 case, or very few cases, to the observed sample
size.
If you want to make a statement based on a long time-series
of observations, there are classical techniques that *might*
be applicable -- What is appropriate would depend on
whether you are tapping some dimension that you is
constant ("is thought to be constant") or that might
have a slow change, relative to the number of census points.
For the simplest instance -- If there is no change expected
or suggested by the data, you might decide to pool all the
avialable data, and present the overall mean and SD, based
on the total N. -- If that comes to a really large N, it will
produce a SD that is too small, because it will not take into
account the standard error of the bias of the estiumations.
If there is slow change, you might argue for a time-series
projection. That would mainly use the most recent points,
but it might afford a more precise estimate of the present
mean than you get by using the latest data alone.
--
Rich Ulrich