I just noticed that if I print 'action' from in my World.step(), the first time, it is a simple array, i.e.
[0. 0.]
But there after (i.e., once planner has actually selected an action) it is a nested array, i.e.,
[[0.]
[0.]]
Is this expected? Or indicative of a bug in my World code? Or a bug in BECCA?
All the Best,
Matt Chapman