[R] Theta in Negative binomial GAM

255 views
Skip to first unread message

Eva Maria Leunissen

unread,
Feb 26, 2017, 7:18:32 PM2/26/17
to r-h...@r-project.org
Hi, I'm fitting a negative binomial GAM (using mgcv) to my data using
family=nb() so theta is estimated during the fitting process. When I then
extract this theta from the model and refit the same model with
family=negbin(theta) it gives a much lower AIC. I know using AIC to compare
negative binomial models should be done with caution (
http://r.789695.n4.nabble.com/How-to-compare-GLM-and-GAM-models-tt827923.html#a827926)
but approximately it is ok. My questions are:

Is it better to do model selection with nb() or with negbin and a known
theta. if the latter how do you know what theta is?

can you compare models using AIC if the theta is different for each model?

Thanks in advance, any help is much appreciated

Kind regards,

Eva Leunissen

[[alternative HTML version deleted]]

______________________________________________
R-h...@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Simon Wood

unread,
Feb 27, 2017, 3:50:04 AM2/27/17
to Eva Maria Leunissen, r-h...@r-project.org
nb() will default to "REML" smoothing parameter estimation, while
negbin() will default to "UBRE" unless you use the 'method="REML"'
option to 'gam'. Using UBRE in place of REML may lead to differences in
model fit, and will also mean that the AIC is not corrected for
smoothing parameter uncertainty (this correction increases the AIC). You
also expect AIC to drop by 2 or so, because negbin is treating theta as
fixed.

So...

m <- gam(...,family=nb())

should give similar AIC values about 2 higher than

th1 <- m$family$getTheta(TRUE)
m1 <- gam(...,family=negbin(th1),method="REML")

If you don't know theta it is better to estimate it.

best,
Simon

ps. Tedious details of AIC smoothing uncertainty correction are in
sections 4 and 5 of this:
http://amstat.tandfonline.com/doi/pdf/10.1080/01621459.2016.1180986
--
Simon Wood, School of Mathematics, University of Bristol BS8 1TW UK
+44 (0)117 33 18273 http://www.maths.bris.ac.uk/~sw15190
Reply all
Reply to author
Forward
0 new messages