Hello everyone!
I've been dabbling with RL for the past months and RL is a very delightful subject. The book by Sutton & Barto has especially been very helpful in grasping some obtruse concepts. However since I haven't taken a formal course on RL, I'm finding it a little difficult to implement traditional examples. Especially because unlike traditional ML where you can make powerful and plot errors out of a library like sklearn, RL doesn't seem to have a standard repository of examples in Python.
I'm working on a continuous domain problem and tabular learning seems to have failed miserably, therefore I am now using Tile Coding in Python to approximate the value function. Since I don't know Lisp too well, I'm currently going through the C implementation on Prof Sutton's page here
http://incompleteideas.net/sutton/book/code/code.html and was wondering if anyone in the academia or otherwise would pass on some advice to me regarding working Pythonic implementations.
Mainly, what would be a good investment in time (languages or related material) right now to pursue it when I move to gradschool? How is it being implemented in the industry? Say any intuitive explanation of how Nest might be using RL to control HVACs and optimize energy consumption?
Thanks!