Hi,
The objective of tile coding, as that of any function approximation
method, is to allow you to approximate closely the functions of
interest. Tile coding is a specific case of linear function
approximation, with binary, sparse features. Because of this, evaluating
the function value is really fast and is independent of the number of
features involved. This is a nice property to have.
If the tiles are initialized randomly (as they are most of the time),
you can think of tile coding as implementing a form of hashing.
Thus, probably the theory of random hash functions is where you could
find most likely an explanation of when to expect tile coding to work
well. My gut feeling is that tile coding will work fine with many
dimensions provided that there is limited interaction between the
dimensions.
Bests,
- Csaba
PS: A formal explanation of tile coding (for what it worth) can be found
in my book:
http://www.ualberta.ca/~szepesva/RLBook.html
See p. 27 of the pdf.
On 12-06-15 7:27 AM, J. Rosado wrote:
> Ok..let me try this again, since it seems my first try was misunderstood :(
>
> I'm doing work and research in the RL area, applied to biped locomotion.
> But all subjects related to AI and RL were new to me. So i pick up
> Richard Sutton book, and start studying by it. I'm now a bit stalled at
> the mountain car example and tile coding. I was able to create the
> matlab code equivalent to the tiles.c code provided by Richard, which i
> attach, so anyone can use it, if they wish and you can also provide
> feedback and possible bugs feedback. My problem is that i still can't
> understand very well what's the objective of tile coding and how does it
> work. So i'm looking for more information and explanations on that.
>
> Regards,
>
> J. Rosado