Documentation for off-policy actor-critic algorithm

7 views
Skip to first unread message

bassel ali

unread,
Feb 4, 2020, 4:03:05 AM2/4/20
to RLPark
Dear RLPARK,

is it possible to provide a documentation for the off-policy actor-critic algorithm code?

Thomas Degris-Dard

unread,
Feb 4, 2020, 9:48:35 AM2/4/20
to github...@googlegroups.com
I suggest to look at this paper:
https://arxiv.org/pdf/1205.4839.pdf

Thomas

Le mar. 4 févr. 2020 à 09:03, bassel ali <bassel.d...@gmail.com> a écrit :
Dear RLPARK,

is it possible to provide a documentation for the off-policy actor-critic algorithm code?

--

---
You received this message because you are subscribed to the Google Groups "RLPark" group.
To unsubscribe from this group and stop receiving emails from it, send an email to githubrlpark...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/githubrlpark/8afb8488-8e1a-4aaf-a121-e8cbe3d3a5c2%40googlegroups.com.

bassel ali

unread,
Mar 24, 2020, 2:21:06 AM3/24/20
to RLPark
Thank you Thomas, I will read the paper, a question if you don't mind.

Is it possible to do batch learning using OFF-PAC?


On Tuesday, February 4, 2020 at 11:48:35 PM UTC+9, Thomas wrote:
I suggest to look at this paper:
https://arxiv.org/pdf/1205.4839.pdf

Thomas

Le mar. 4 févr. 2020 à 09:03, bassel ali <bassel....@gmail.com> a écrit :
Dear RLPARK,

is it possible to provide a documentation for the off-policy actor-critic algorithm code?

--

---
You received this message because you are subscribed to the Google Groups "RLPark" group.
To unsubscribe from this group and stop receiving emails from it, send an email to github...@googlegroups.com.

bassel ali

unread,
Mar 24, 2020, 2:23:15 AM3/24/20
to RLPark
Sorry, a follow-up question, if not possible to do batch learning with OFF-PAC, do you know
any off policy policy gradient method that can do batch learning?

Thanks in advance and sorry for the trouble
Reply all
Reply to author
Forward
0 new messages