Hi everyone,
I am running models to predict the distribution of various lemur
species in Madagascar (I have relatively small data-sets (mostly <100
occurrence points for each species)) and am uncertain of the
distinction between the run types crossvalidate and subsample. I have
run all my models for both types with the same parameters (random
seed, use duplicate records...) and a 25% test percentage. The
predicted areas are very similar, however, the subsample run types is
slightly more conservative (slightly smaller extent of predicted
occurrence). The description of the subsample run type says it uses a
random 75% of occurrence points to train the model, then tests it with
the remaining 25%. The crossvalidate run type says it divides the
data into replicates folds and each fold is in turn used for test
data. This is somewhat confusing, I am unsure of what exactly the
model is doing with this run type. The default settings are for
crossvalidate with the number of replicates equal to 1. I was
wondering if anyone had any ideas on which is preferred and why or had
any suggestions for me.
If anyone can help clarify this issue it would be greatly appreciated!
Thanks,
Heather
--
You received this message because you are subscribed to the Google Groups "Maxent" group.
To post to this group, send email to
max...@googlegroups.com.
To unsubscribe from this group, send email to
maxent+un...@googlegroups.com.
For more options, visit this group at
http://groups.google.com/group/maxent?hl=en.