Hi All,
I'd like to make sure that my understanding of LOOic and pareto's K is correct so that I can really use this method to compare my multiple models.
According to the manual,
looic estimate from the loo package is 2*
elpd_loo, elpd_loo is the expected log pointwise predictive density. In plain English, (elpd_loo) expected log pointwise predictive density is the probability to predict the ith data point with current data set without the ith data point. So it is expressed as eqn (4) and (5) of Vehtari A., Gelman A., and Gabry J. 2016. This is Bayesian because we are using the posterior probability of parameters and the probability of the ith data given the estimated parameters to calculate elpd_loo. In other words, looic is a measure of how good this model is to fit the data.
The loo package uses a Pareto-smoothed importance sampling (PSIS) method to estimate the expected log pointwise predictive density. In this method, there is a critical parameter K (Pareto's K) for each data point to judge whether this data point being accurately predicted. If this point is not being predicted accurately (low probability I guess??), the Pareto's K would be large. According to the manual, large Pareto's K (>0.7) means the model posterior would be too different if one data point is being removed. This suggest that the model is not capturing the data well (i.e. some data points with high K are highly influential and not being considered by the model).
Following this logic, I can not trust the looic estimate if most (say>80%) of the Pareto's K is higher than 0.7.
To deal with the issue with high Pareto's K, one can either change the model structure or transform the data. Are there any other solutions?
Am I understanding the loo and interpreting it correctly?
Thank you for your help!
OSCAR