Data point limit?

28 views

Skip to first unread message

Gericke

unread,

Jan 5, 2015, 3:37:58 PM1/5/15

to vistrai...@googlegroups.com

I have a very large input dataset that covers the contiguous U.S. After running it through SAHM and getting the merged dataset, I have over 100k positives and 550k background locations collapsed to 1km pixels (actual traps plus some simulated background for areas not reporting traps). When I inspected the model output (MARS application), I noticed in the confusion matrix a reduced number of absence (background) points being reported. However, the correct number shows in the MDS shapefiles. I thought that the model probability surface was fitting excessively high likelihoods throughout the map, and it seems that the majority of my background points were dropped (leaving the model unconstrained). Is there an option to expand the number of data points for model fitting, or do you have other recommendations (like a spatial thinning algorithm)?

FYI, when I added the number of points retained in the model it didn't reach some sensible round number: 106,496

Thanks.

Gericke

screencap_datacoverage.JPG

mars.confusion.matrix.jpg

Gericke

unread,

Jan 5, 2015, 5:08:34 PM1/5/15

to vistrai...@googlegroups.com

Resolved. I had most of my background points flagged as -9999 instead of -9998 (some of them derived from a background surface generator and were correctly flagged as -9998, which is why I had a small subset being fitted in the model). The reason I didn't catch it is because the MDS shapefiles will show both background flag values. I just had the wrong flag applied for a MARS model. A quick manipulation in R fixed it.

Reply all

Reply to author

Forward

0 new messages