Hi Nolan,
Presumably Maxent (and any other SDM) using background sites which could land on a presence (known or unknown) would assign background sites that landed on a presence a higher prediction than if the background landed on a true absence (known or unknown).
Generally for a well-tuned model the maximum achievable AUC calculated with background sites will be 1 - a/2 where a is the prevalence (mean probability of occurrence) of the species. This means if you get an AUC of, say, 0.6, you can't say whether this is arbitrarily good or bad--if the species occupies 80% of the study region (mean Pr(occ)= 0.8), then this would be an excellent AUC. But if it occupied less 0.6 might be low. However, you should be able to compare AUCs between models for the same
species using the same data (pres + bg)... higher values should indicate
better models because they should correlate positively with AUC
calculated with presences and absences (which is what we normally would
like to have).
However, it's possible to have AUC (pres+bg) > 1 - a/2 if you overtune your model, in which case there will be a *negative* relationship between AUC calculated with presences/absences and presences/background sites. So chasing a higher AUC(pres+bg) can actually make a worse model, even if it seems better.
I have no empirical basis for saying this, but I expect that Maxent's use of LASSO regularization tends to guard against overtuning models, so the point in the paragraph above is less an issue. However, I have seen other algorithms which I really felt were overtuned, and this gave very high AUC (pres+bg), but probably would have given low AUC (pres+abs).
Adam
Assistant Scientist
Missouri Botanical Garden