Have bad news for you here.
So the problem is that the player passes the ball backwards, and that pass happens in between the time steps (we perform a step every 100ms).
So neither at the step before nor at the step after the player does not have ball ownership. That's why checkpoint reward is not granted.
There does not seem to be an easy fix that does not break backwards compatibility (we'll have to publish a new version of the env, rerun all experiments, update paper).
Given the limited scope of the problem and the fact that it happens only to checkpoint reward (a custom reward that we came up with to simplify the learning),
we are leaning towards leaving things as they are right now.