[OT] Proposal: A skill-based Dixonary rating (and invitation to help evaluate it)

6 views
Skip to first unread message

Daniel B. Widdis

unread,
Jan 19, 2026, 9:01:40 AM (16 hours ago) Jan 19
to Dixonary
TLDR: This email proposes adding a new method of Dixonary skill rating to be published as a “leader board” alongside the scorekeeper’s statistics posts every 25 rounds, and solicits volunteers to help evaluate and tune the rating system. If you don’t care about Dixonary statistics, scores, or ratings, feel free to stop reading now.

If you’re still reading, you’re probably one of several players with at least a passing interest in different measures of Dixonary performance. Mike’s statistics posts scratch that itch for many of us, especially those like me who don’t typically score that high and instead seek validation elsewhere, such as being a tricky dealer or maintaining back-to-back streaks of wins.

One long-standing challenge is that most existing metrics reward longevity more than current performance. Total scores accumulate over thousands of rounds, which Efrem illustrated very clearly in a graph after the Round 3575 stats report. The only reason I’m currently in third place overall is that Paul took a long break from the game. I’m sure he’ll overtake me again eventually, but it will take a while!

Average scores improve on total scores, but they have their own limitations. In the early days of the game there were more active players and more points available, so averages are biased toward those early decades and don’t move much over time. Tim Lodge once wondered whether his performance was declining; Paul responded after Round 3600 with evidence to the contrary; but it’s still difficult for long-term averages to reflect recent performance.

Counting total wins (outright or tied), as Paul also did after Round 3600, addresses the “points available” issue, but it still rewards longevity. In that same post, Paul observed that most games and sports instead use *ratings* to address exactly these problems. I’ll quote him here:

The statistics that Mike has been carefully maintaining since Round 1000 are now such an expected fixture that it has only lately begun to occur to me that they are quite unlike most of the leader boards you will find for other games or sports.
 
Chess and Scrabble players are accorded an elaborately computed rating after every match or tournament. Our equivalent to that is (I suppose) the 5-round rolling scores report. But there are other, simpler league tables, such as you might find at your local squash or tennis club, that list wins, draws and losses.

That “elaborately computed rating” is what most people know as an Elo rating. Very roughly, Elo treats your rating as a measure of skill: beating a lower-rated player changes your rating only slightly, while beating a higher-rated player results in a larger gain. Over time, ratings settle into a reasonably stable reflection of current ability.

Classic Elo works only for two-player games, but there are modern extensions designed for multiplayer games. One of the best-known is Microsoft’s TrueSkill system used to match Xbox players; an open-source equivalent called OpenSkill makes similar ideas available more broadly.  

For Dixonary, this kind of rating has some attractive properties:
  • It is based on relative performance against other players, rewarding consistently ranking ahead of (or tying) strong opponents.
  • It incorporates uncertainty, rewarding consistent performance rather than a few lucky results.
  • It allows uncertainty to grow over time, so new players, or players returning after a long gap (Hi Theresa!), can become competitive in the rankings quickly.
I’ve been experimenting with this approach, and it produces some interesting results (including confirming that Tim Lodge is still doing quite well).

One particularly nice feature is that it also supports *separate* ratings for different roles. In our case, that means dealer versus guesser. While we don’t award points for a D0, it’s clearly an accomplishment; under this system, fooling strong guessers improves your dealer rating: a D2 achieved fooling less consistent guessers may not be as impressive as a D3 fooling expert guessers. Similarly, guessers are rewarded more for correctly identifying words dealt by tricky dealers.

Long story short: I’ve already built a system to calculate these ratings, but before proposing anything official I’d like to invite interested players to an off-group email thread to discuss, evaluate, and tune it, as well help decide what a 25-round report should look like if we decide to publish one.

If you’d like to take part in that discussion, let me know!

Paul Keating

unread,
Jan 19, 2026, 9:06:12 AM (16 hours ago) Jan 19
to dixo...@googlegroups.com
I am, of course, in.

P

Hugo Kornelis

unread,
Jan 19, 2026, 9:38:16 AM (16 hours ago) Jan 19
to dixo...@googlegroups.com
My scores usually suck, so there won't be much ego involved for me in this project. But I am a data geek and a statistics lover.
So count me in!

Op 19-1-2026 om 15:00 schreef Daniel B. Widdis:
--
You received this message because you are subscribed to the Google Groups "Dixonary" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dixonary+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dixonary/CAAbgggrzMghE4WhUS5GzsdLnZ0PJju1mUMcNeOF-1LW6bwHD8w%40mail.gmail.com.

Tim Bourne

unread,
Jan 19, 2026, 9:42:03 AM (16 hours ago) Jan 19
to Dixonary
Yes, that sounds interesting.
Tim B

Judy Madnick

unread,
Jan 19, 2026, 10:12:17 AM (15 hours ago) Jan 19
to dixo...@googlegroups.com
I love the idea but doubt that I would have anything to contribute to a discussion. I enjoy seeing "different measures of Dixonary performance" because I will never move higher in total points and, yes, total points "rewards longevity" (except for John Barrs' total!). I was pleased to have at least surpassed Tim Bourne (sorry, Tim!), who started playing at about the same time that I did. Most likely I gained points during periods when Tim wasn't available to play!

 SHEFLER, M                7366  Started in round:    162  
 LODGE, T                  6534  Started in round:    593  
 WIDDIS, D                 5181  Started in round:    444  
 KEATING, P                4782  Started in round:    186  
 BARRS, J                  4560  Started in round:   1528  
 MADNICK, J                3519  Started in round:   1312  
 BOURNE, T                 3491  Started in round:   1335    

Thanks, Dan, for working on this project!
 
Judy

Original Message
From: "Daniel B. Widdis" <wid...@dixonary.net>
Date: 1/19/2026 9:00:00 AM
Subject: [Dixonary] [OT] Proposal: A skill-based Dixonary rating (and invitation to help evaluate it)

Efrem Mallach

unread,
Jan 19, 2026, 10:20:53 AM (15 hours ago) Jan 19
to dixo...@googlegroups.com
Dan,

Count me in,

I suspect that total scores for the past n rounds played (excluding those played as dealer; n being large enough to smooth out the bumps but small enough to exclude rounds played in a larger group; say 50 or 100) would produce a similar result while being easier to calculate. If someone computes our OpenSkill ratings, it would be interesting to see the correlation. 

Efrem

Tim Lodge

unread,
Jan 19, 2026, 10:48:57 AM (15 hours ago) Jan 19
to Dixonary
Dan

Please include me in the discussion. 

-- Tim L

Chowie

unread,
Jan 19, 2026, 12:28:28 PM (13 hours ago) Jan 19
to dixo...@googlegroups.com
I'm interested in seeing the results, but not so much the discussion. Thanks though!

--
You received this message because you are subscribed to the Google Groups "Dixonary" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dixonary+u...@googlegroups.com.


--
~Bending under the weight of His mercies~


​"For we cannot do anything against the truth, 
but only for the truth​." 
II Corinthians 13:8

Shani Naylor

unread,
Jan 19, 2026, 1:35:54 PM (12 hours ago) Jan 19
to dixo...@googlegroups.com

Glenn Thomas Davis

unread,
Jan 19, 2026, 8:15:58 PM (5 hours ago) Jan 19
to dixo...@googlegroups.com
In.

—Glenn

. . .
The mind is not a vessel to be filled, but a fire to be kindled.
—Plutarch


Reply all
Reply to author
Forward
0 new messages