The tuning tool reports 1,1TFlops on GTX1080Ti, I don't know how this relates to the units given above.
In more practical dimensions: client produces nearly 1 game per minute, that is 76 years for 40 million games.
Actually the clients uses only 35% of GPU as reported by the taskmanager.
(Starting another client doesn't help - GPU usage goes up to 50%, but it slows down the first client.)
The game itself uses 50% of the GPU. Maybe here is some room for improvement.
And a ready-to-use program using CUDA could probably speed up things.