Yes, all the learning would be about the position evaluation function. If the number of layers were on par with the "ultimate" evaluation function, there would be no need for node computations, all would be sorted out from a very precise evaluation function, with the ultimate intuition level (à la 42, or maybe just à la intrinsic topological complexity of full chess position set or its embedding in the space spanned by the appropriately big neural net basis of functions). That's a hypothesis for rephrasing the above.... no tomatoes please, i am just doing a hit and run, and reading the above, my itch got hard to repress.