Hi all,
I see there are a lot of posts in the group about the mb.gof.test() but I can't find anything that seems to address my problem.
I have been using occupancy models for a decade now - a nice chunk of that has been through unmarked. I am having an issue now with single-season occupancy models that I have never before had - I cannot achieve a reasonable c-hat, no matter how I build my models.
The data:
> 1/0 data from a variety of forest structures (young, old, etc.) across an entire state
> Two visits per site/year
> Two years - year is included as a covariate - most sites visited in only one year
> Large sample size (~1,500)
> I have good detection covariates that explain p nicely (as best I can tell)
> I also have great habitat data that explain species occupancy patterns
The analyses:
> Single-season occu
> Linear variables are scaled
> Nothing else stands out as unusual to me about this analysis
> Focusing on one bird species for now, though the dataset has a bunch of birds I could model if I wanted
> detection estimates are reasonable (0.65ish)
> Naive occupancy is in the neighborhood of 0.50
The issue:
No matter how I build my models -- and almost regardless of the species I choose (when I try modeling other species) -- c-hat is through the roof. Like 10-40. I tried several other species and consistently get this issue -- with one exception -- and that species had c-hat = 0.20.
I'm at a complete loss - I've tried scaling variables, NOT scaling them, subsetting the data to focus on particular regions, including ALL detection covariates... or none.... I've tried running super simple models with few covariates... and really complex models with many covariates - it doesn't seem to matter. And species doesn't matter THAT much either. Here's the kind of output I am getting:
####################################
MacKenzie and Bailey goodness-of-fit for single-season occupancy model
Pearson chi-square table:
Cohort Observed Expected Chi-square
00 0 710 706.36 0.02
01 0 73 88.32 2.66
10 0 134 126.39 0.46
11 0 158 153.94 0.11
Chi-square statistic = 3.241
Number of bootstrap samples = 1000
P-value = 0
Quantiles of bootstrapped statistics:
0% 25% 50% 75% 100%
0.00046 0.07730 0.18452 0.36060 2.90067
Estimate of c-hat = 11.95
######################################
Any tips? Maybe I ought to cut my losses to work with glms for this project?
Thank you in advance - happy to provide more detail if it helps...