Dear Angel,
I have created an excel file where I manually calculated the normalized gini index for the estimated training values. Therefore, I have opened your attached solution, copied the estimated class values into excel and discarded the test values. Afterwards, I followed the instructions from [0] to calculate the normalized gini index. Basically, you sort the values, normalize the values and calculate the area under the cumulative curve. The only difference to the described procedure is that I have ordered the values in descending order. The second tab in the attached excel does the same calculations, but the values are order by the target so that the max obtainable gini index is calculated. Afterwards, I just divided both gini indexes (actual and maximum) and achieved a normalized value-0.744074858274709 that is up to the 13th decimal place the same as the value in HL.
An alternative description to calculate the gini index is given in [1] and our implementation can be found at [2].
I hope this helps,
Michael
[0] https://theblog.github.io/post/gini-coefficient-intuitive-explanation/
[1] https://www.kaggle.com/c/ClaimPredictionChallenge/discussion/703
--
You received this message because you are subscribed to the Google Groups "HeuristicLab" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
heuristiclab...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/heuristiclab/8a65e041-6fb5-4f2b-9cc3-63ab2ab5f1b9n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/heuristiclab/312a4ca6-4af7-4c6b-80d7-71cd2ca10e49n%40googlegroups.com.