Is it possible to use keras-rl for multi agent environments, where each one has its own model to train (each agent has a different model)?
I have been trying to do this using threads, but after trying many different ways to configure agents, environments and launching threads at different points, I always manage to get a TensorFlow exception of the kind:
ValueError: Tensor Tensor("activation_2/Softmax:0", shape=(?, 4), dtype=float32) is not an element of this graph.
Something interesting would be to start with an environment that allows different agents, such as go, and then generalize to others more complicated. |