Hi,
I'm building a density network, and I'm considering using the NormalInverseGuassian. I am seeking recommendations for how to ensure that the parameters passed to the distribution are valid, specifically, that |tailweight| >= |skewness|.
My first idea is to define the loss such that the loss is an arbitrarily large number when the assumption is violated. That doesn't feel robust though - I worry that at prediction time, there is nothing to prevent the model from passing invalid parameters to the distribution.
My model will look something like the model I described
here, though if I can figure out how to get the NormalInverseGaussian to work, I may not need a mixture distribution.
Thanks,
Dan