Special considerations when using ActorCritic MDPSolver

14 views

Skip to first unread message

zju...@tenkiv.org

unread,

May 19, 2018, 1:57:54 PM5/19/18

to BURLAP Discussion

I have started experimenting with Burlap and after trying the tutorials I tried to learn more by switching out the MDPSolver for the various examples. I could get most of them to work but I am having quite a bit of difficulty with ActorCritic. Are there some special considerations that need to be taken with ActorCritic? Or, even better, is there a tutorial / example that uses ActorCritic? After looking through the source code it seems like the critic updates the actors policy automatically and I shouldn't need to do anything externally. Sorry for the basic question, I am just getting started with Burlap and reinforcement learning in general.

Reply all

Reply to author

Forward

0 new messages