Re: Mountain Car

12 views
Skip to first unread message

Brian Tanner

unread,
Aug 7, 2015, 10:19:34 AM8/7/15
to Amir Hossein Mojarrad, rl-li...@googlegroups.com
Hi Amir.  I also have sent this message to the RL-Library group in case anyone else wants to chime in.

I have not looked at this code in many years, so it is possible it does not work anymore. More likely though, it’s a misunderstanding.

The second link (the tutorial) has 3 steps - getting the environment (mountain car), the agent (random), and the experiment (SampleExperiment).

Are you able to get the experiment and the environment parts working?

As for the agent - it uses some (good) code that I wrote for Tile Coding from this project: https://code.google.com/p/bt-agentlib/

You should be able to download, unpack, and run as the instructions say. Do you have ant and Java installed on your system? What system are you using? Can you provide copy+paste input/output from your console?

--
Brian Tanner
Operations Manager, Fire Inspector
br...@fireplan.ca
204-975-4901 Extension 102

Fire Plan Strategies: Fire safety training, planning, signs, and supplies across Canada!

On Aug 7, 2015, at 1:19 AM, Amir Hossein Mojarrad <ahmod...@gmail.com> wrote:



Dear Brian


I've been searching for tow weeks  foe mountain car with tile coading and RL-GLUE

is there any help or step by step guide?

let me tell you what i am doing, if you don't mind :

i download and run EpsilonGreedyTileCodingSarsaLambda-Java-R30.tar.gz

it is not working with this ERROR : 
target "run" does not exist in the project "btannerAgentLib"

so as i checked the files it has tile Coding file , but there is no dependency or relation with this TILE CODING and Mountain CAR

When i run :  SampleExperimentRLGlue-Java-R1068.tar.gz

every thing is ok 

but there is no refers to TILE CODING in this source code .

please correct me if i'm wrong : " SampleExperimentRLGlue-Java-R1068.tar.gz IS NOT USING TILE CODING?

thank you for your time 


--
Sincerely Yours
Amir Hossein Mojarrad
Enterprise Administrator (MCITP) #SR6485499
ISO 27001- DNV : # 011-THR-IS-00115081
Mikrotik certified  (MTCNA) :# 1111NA006
--------------------------------------------------
IRIK.Co
URL: www.irik.ir
Phone : +98(0)713 6274400
Fax : +98(0)7136274400
Mobile / Viber  : +98(0)917 302 83 29
Email : ahm...@gmail.com
Email : a.moj...@irik.ir
Skype : Amir.Hussein.Modjarrad
MSN : ahmod...@gmail.com

Brian Tanner

unread,
Aug 8, 2015, 3:11:54 PM8/8/15
to Amir Hossein Mojarrad, rl-li...@googlegroups.com
This sample is a random agent. The other one that you mentioned uses tile coding, but it’s my own code and it’s not cleaned up for public use.  If you are not very savvy with reinforcement learning and Java programming, I think it may not be the best place to start.

The actual code for the agent is split across many files using interfaces.  The project the code comes from was one that allowed easy swapping of function approximators, action selection strategies,etc.. which does not make it easy to learn from.

The code is all in the bt-agentlib.googlecode.com project:

--
Brian Tanner
Operations Manager, Fire Inspector
br...@fireplan.ca
204-975-4901 Extension 102

Fire Plan Strategies: Fire safety training, planning, signs, and supplies across Canada!

On Aug 8, 2015, at 2:49 AM, Amir Hossein Mojarrad <ahmod...@gmail.com> wrote:

Dear Brian 
Thank you so much for your reply 

the attach file is what i'm doing and the out put is as you seen in the file.

i just want to reconfirm with you that this Sample... is using tile? 
and if it is so, where can i found the source code of Tile ?


thank you so much for your time 
looking forward to hear from you 
 
-- 


-- 


-- 
Sincerely Yours
Amir Hossein Mojarrad
Enterprise Administrator (MCITP) #SR6485499
ISO 27001- DNV : # 011-THR-IS-00115081
Mikrotik certified  (MTCNA) :# 1111NA006
--------------------------------------------------
IRIK.Co
URL: www.irik.ir
Phone : +98(0)713 6274400
Fax : +98(0)7136274400
Mobile / Viber  : +98(0)917 302 83 29
Email : ahm...@gmail.com
Email : a.moj...@irik.ir
Skype : Amir.Hussein.Modjarrad
MSN : ahmod...@gmail.com
<brian.txt>

Reply all
Reply to author
Forward
0 new messages