Hi!
It looks like there is an error in the provided 'normalize_object_phrase_list' function.
Resulting list is getting constructed not from head words, but from original phrases.
I have created a PR to fix it. After this fix baseline quality improves to 0.189.
Kind regards,
Tsimafei