Special considerations when using ActorCritic MDPSolver
14 views
Skip to first unread message
zju...@tenkiv.org
unread,
May 19, 2018, 1:57:54 PM5/19/18
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to BURLAP Discussion
I have started experimenting with Burlap and after trying the tutorials I tried to learn more by switching out the MDPSolver for the various examples. I could get most of them to work but I am having quite a bit of difficulty with ActorCritic. Are there some special considerations that need to be taken with ActorCritic? Or, even better, is there a tutorial / example that uses ActorCritic? After looking through the source code it seems like the critic updates the actors policy automatically and I shouldn't need to do anything externally. Sorry for the basic question, I am just getting started with Burlap and reinforcement learning in general.