Rules:
100 big blinds. I don’t mind if they are 50-100 or 1-2; 50-100 would match the no-limit 2-player competition, but 1-2 might be a little easier for developing agents, and iscloser to more common human levels, so I suppose I will advocate 1-2. Then all agents would start with 200. 100 big blinds is most standard for humans. I think there is value in using the same variant as is typical for humans for several reasons. One is that, if we are successful, we can potentially receive significant media attention (and possibly funding by interested parties) for research/future competitions/workshops/etc. Second is that there would possibly be large amounts of human data available to learn from. Etc. There are already significant computational challenges requiring new techniques for going from 2-player to 6-player, and I think 100-bb variant will still be quite difficult computationally.
Interested competitors: please respond in this thread if interested. I believe at least four teams expressed interest already at the workshop.
Scientific merit:
It seems clear to me that significant algorithmic innovations would be needed to develop strong agents for 100 BB 6-player no-limit Texas Hold’em. My understanding is that the best 3-player agents either just run CFR, or fix preflop strategies and just run CFR for the sequences where two players make it to the flop. I think even to apply either of those approaches to 6-player would be highly nontrivial.
Theoretically, it would be very interesting to see how the equilibrium-based approaches can do, if CFR has interesting theoretical guarantees in the multi-player setting, if improvements can obtain theoretical guarantees, etc.
I also think there is room for significant innovations in terms of new abstraction/equilibrium algorithms, game decomposition, and real-time endgame solving beyond what is present for 2-player variants. I think Kevin’s functional regret minimization approach would be applicable, as we could potentially abstract away some information about what the specific history/players are in the hand. E.g., if utg+1 raises and cutoff calls, maybe that could get bucketed with situations where utg+1 raises and button calls. As far as I know, no teams currently use any sort of “player abstraction” for 3-player limit, which seems like an interesting new direction.
I also think that even doing the evaluation could lead to interesting research questions in tournament design/variance reduction that aren’t as relevant for 2-player. I imagine there will be 6+ entrants, but even if there were just 4 or 5, it would be interesting theoretically to determine the “optimal” way of performing the evaluation. E.g., which/how many permutations of the agents to use.
Resources:
I think a limit much lower than 7 seconds per hand (on average) would be ok. Perhaps 2 seconds per hand would be reasonable. Agents fold much more often preflop 6-handed than heads-up, and so there would be many fewer “difficult” postflop decisions to be made per agent.
Statistical significance:
I’m sure there is a lot of information available by searching forums such as 2+2 to determine variance. I am looking at a blog post now that says that “a fairly typical standard deviation” is 90 bb/100: http://www.nsdpoker.com/2011/03/nlhe_6m_pros/. Simple searches show that the SD for HU exceeds 100 bb/100 by comparison. I can ask some experts who have databases over large sample sizes for further information. But I think it’s fair to say that the variance is a bit lower for 6max than HU. Combining this with reducing the time per hand by 71% as described above could make it very realistic to obtain significance. There are questions like how often to permute the players, and whether you can integrate forms of duplicate scoring when doing so, etc. I think those are very interesting research questions.
Win rates will depend heavily on the skill disparity, though they will likely be lower in general than for HU. I certainly expect the top agents will have an average winrate exceeding 10 bb/100, and maybe even 20.
-Sam
Sounds great. I'm in.
Sounds great. I'm in.Is there a protocol / code available for this new 6 player game ?
--
You received this message because you are subscribed to the Google Groups "Annual Computer Poker Competition" group.
To unsubscribe from this group and stop receiving emails from it, send an email to computer-poke...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/computer-poker/7310116c-e297-482d-8e53-8046272542a7%40googlegroups.com.