Currently in training games LCZ has been maintaining a draw rate of ~9%. In recent match games, it is averaging ~33%, and that is with sometime substantial Elo differences between the nets. It would not be unreasonable to estimate a draw rate of ~34% under match conditions for self-play (identical nets).
This means that currently, for our training data, 25% of the total games (1 out of every 4) which would normally have ended in a draw under “best play” according to LCZ, are instead decisive due to a constant temperature of 1.
While I was not sure if producing an incorrect/false result in (minimally) 25% of the training games would be detrimental or not, I became more concerned when considering the numbers exclusively for draws.
Based on the empirical evidence of the match play games, we know that left to its own devices LCZ will draw ~34% of the games against itself. The best case scenario is that no wins are being thrown away in training games (unlikely), which leaves the genuine draw rate at the full 9% for training games. This means that the “normal” result for drawish positions is only achieved 26.4% of the time, and that the expected result is being lost to temperature 73.6% of the time. And those are best case numbers assuming that no part of the 9% of draws is made up of otherwise decisive games decaying into draws due to temperature blunders.
I understand that the role temperature plays is a vital one. LCZ needs the variety and the ability to explore new positions it provides. However, I am a bit concerned that constant temperature of 1 is a bit too effective. Perhaps there is a way to provide the variety and exploration LCZ needs without significantly altering the normal statistical distribution of the games?