I am running a gbm with tweedie marginal distribution to predict loss cost.... I suspect that the gbm does not correclty hand the offset parameter:
Using different exposures as offset does not alter the predicted outcome. In the example below I tried both log and un logaritmed exposures and i optained the same (unreliable) result...
I suggest the documentation to provide further details on how offset and link functions are handled in gbm
#first model:
gbm <- h2o.gbm(x = c(predictors.numeric,predictors.categorical),
y = "losscost",
learn_rate = 0.01,
max_depth = 4,
min_rows = 5,
distribution="tweedie",
offset_column = 'exposure',
training_frame = trainSplit,
validation_frame = validSplit,
stopping_metric="AUTO")
gbm2 <- h2o.gbm(x = c(predictors.numeric,predictors.categorical),
y = "losscost",
learn_rate = 0.01,
max_depth = 4,
min_rows = 5,
distribution="tweedie",
offset_column = 'log_exposure',
training_frame = trainSplit,
validation_frame = validSplit,
stopping_metric="AUTO")