Goal output when training with reward shaping wrappers

72 views
Skip to first unread message

Yutai Zhou

unread,
Sep 3, 2020, 10:42:56 AM9/3/20
to Google Research Football
I am training agents using both the scoring and checkpoints wrapper. I would like to understand how the different wrappers used can impact the reward logging.

In the multiagent setting, the default behavior is that every agent shares the same reward. This means if the team advances 10% of field length, every agent gets 0.1 point, if the team scores a goal, every agent gets 1 point. Is that correct?

If checkpoints wrapper is used, does the logged reward include the checkpoints rewards? If so, how can I log a goal-based only reward metric in addition to a goal+distance based reward metric?

stanczyk

unread,
Sep 8, 2020, 3:49:05 AM9/8/20
to Google Research Football
Reply all
Reply to author
Forward
0 new messages