[OT] Proposal: A skill-based Dixonary rating (and invitation to help evaluate it)

Daniel B. Widdis

unread,

Jan 19, 2026, 9:01:40 AM (16 hours ago) Jan 19

to Dixonary

TLDR: This email proposes adding a new method of Dixonary skill rating to be published as a “leader board” alongside the scorekeeper’s statistics posts every 25 rounds, and solicits volunteers to help evaluate and tune the rating system. If you don’t care about Dixonary statistics, scores, or ratings, feel free to stop reading now.

If you’re still reading, you’re probably one of several players with at least a passing interest in different measures of Dixonary performance. Mike’s statistics posts scratch that itch for many of us, especially those like me who don’t typically score that high and instead seek validation elsewhere, such as being a tricky dealer or maintaining back-to-back streaks of wins.

One long-standing challenge is that most existing metrics reward longevity more than current performance. Total scores accumulate over thousands of rounds, which Efrem illustrated very clearly in a graph after the Round 3575 stats report. The only reason I’m currently in third place overall is that Paul took a long break from the game. I’m sure he’ll overtake me again eventually, but it will take a while!

Average scores improve on total scores, but they have their own limitations. In the early days of the game there were more active players and more points available, so averages are biased toward those early decades and don’t move much over time. Tim Lodge once wondered whether his performance was declining; Paul responded after Round 3600 with evidence to the contrary; but it’s still difficult for long-term averages to reflect recent performance.

Counting total wins (outright or tied), as Paul also did after Round 3600, addresses the “points available” issue, but it still rewards longevity. In that same post, Paul observed that most games and sports instead use *ratings* to address exactly these problems. I’ll quote him here:

The statistics that Mike has been carefully maintaining since Round 1000 are now such an expected fixture that it has only lately begun to occur to me that they are quite unlike most of the leader boards you will find for other games or sports.

Chess and Scrabble players are accorded an elaborately computed rating after every match or tournament. Our equivalent to that is (I suppose) the 5-round rolling scores report. But there are other, simpler league tables, such as you might find at your local squash or tennis club, that list wins, draws and losses.

That “elaborately computed rating” is what most people know as an Elo rating. Very roughly, Elo treats your rating as a measure of skill: beating a lower-rated player changes your rating only slightly, while beating a higher-rated player results in a larger gain. Over time, ratings settle into a reasonably stable reflection of current ability.

Classic Elo works only for two-player games, but there are modern extensions designed for multiplayer games. One of the best-known is Microsoft’s TrueSkill system used to match Xbox players; an open-source equivalent called OpenSkill makes similar ideas available more broadly.

For Dixonary, this kind of rating has some attractive properties:

It is based on relative performance against other players, rewarding consistently ranking ahead of (or tying) strong opponents.
It incorporates uncertainty, rewarding consistent performance rather than a few lucky results.
It allows uncertainty to grow over time, so new players, or players returning after a long gap (Hi Theresa!), can become competitive in the rankings quickly.

I’ve been experimenting with this approach, and it produces some interesting results (including confirming that Tim Lodge is still doing quite well).

One particularly nice feature is that it also supports *separate* ratings for different roles. In our case, that means dealer versus guesser. While we don’t award points for a D0, it’s clearly an accomplishment; under this system, fooling strong guessers improves your dealer rating: a D2 achieved fooling less consistent guessers may not be as impressive as a D3 fooling expert guessers. Similarly, guessers are rewarded more for correctly identifying words dealt by tricky dealers.

Long story short: I’ve already built a system to calculate these ratings, but before proposing anything official I’d like to invite interested players to an off-group email thread to discuss, evaluate, and tune it, as well help decide what a 25-round report should look like if we decide to publish one.

If you’d like to take part in that discussion, let me know!

Paul Keating

unread,

Jan 19, 2026, 9:06:12 AM (16 hours ago) Jan 19

to dixo...@googlegroups.com

I am, of course, in.

P

Hugo Kornelis

unread,

Jan 19, 2026, 9:38:16 AM (16 hours ago) Jan 19

to dixo...@googlegroups.com

My scores usually suck, so there won't be much ego involved for me in this project. But I am a data geek and a statistics lover.
So count me in!

Op 19-1-2026 om 15:00 schreef Daniel B. Widdis:

--
You received this message because you are subscribed to the Google Groups "Dixonary" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dixonary+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dixonary/CAAbgggrzMghE4WhUS5GzsdLnZ0PJju1mUMcNeOF-1LW6bwHD8w%40mail.gmail.com.

Tim Bourne

unread,

Jan 19, 2026, 9:42:03 AM (16 hours ago) Jan 19

to Dixonary

Yes, that sounds interesting.

Tim B

Judy Madnick

unread,

Jan 19, 2026, 10:12:17 AM (15 hours ago) Jan 19

to dixo...@googlegroups.com

I love the idea but doubt that I would have anything to contribute to a discussion. I enjoy seeing "different measures of Dixonary performance" because I will never move higher in total points and, yes, total points "rewards longevity" (except for John Barrs' total!). I was pleased to have at least surpassed Tim Bourne (sorry, Tim!), who started playing at about the same time that I did. Most likely I gained points during periods when Tim wasn't available to play!

SHEFLER, M 7366 Started in round: 162
LODGE, T 6534 Started in round: 593
WIDDIS, D 5181 Started in round: 444
KEATING, P 4782 Started in round: 186
BARRS, J 4560 Started in round: 1528
MADNICK, J 3519 Started in round: 1312
BOURNE, T 3491 Started in round: 1335

Thanks, Dan, for working on this project!

Judy

Original Message

From: "Daniel B. Widdis" <wid...@dixonary.net>

To: dixo...@googlegroups.com;

Date: 1/19/2026 9:00:00 AM

Subject: [Dixonary] [OT] Proposal: A skill-based Dixonary rating (and invitation to help evaluate it)

Efrem Mallach

unread,

Jan 19, 2026, 10:20:53 AM (15 hours ago) Jan 19

to dixo...@googlegroups.com

Dan,

Count me in,

I suspect that total scores for the past n rounds played (excluding those played as dealer; n being large enough to smooth out the bumps but small enough to exclude rounds played in a larger group; say 50 or 100) would produce a similar result while being easier to calculate. If someone computes our OpenSkill ratings, it would be interesting to see the correlation.

Efrem

Tim Lodge

unread,

Jan 19, 2026, 10:48:57 AM (15 hours ago) Jan 19

to Dixonary

Dan

Please include me in the discussion.

-- Tim L

Chowie

unread,

Jan 19, 2026, 12:28:28 PM (13 hours ago) Jan 19

to dixo...@googlegroups.com

I'm interested in seeing the results, but not so much the discussion. Thanks though!

--
You received this message because you are subscribed to the Google Groups "Dixonary" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dixonary+u...@googlegroups.com.

To view this discussion visit https://groups.google.com/d/msgid/dixonary/73db657c-f4c7-4c32-8776-3339cda914a1n%40googlegroups.com.

--

~Bending under the weight of His mercies~

"For we cannot do anything against the truth,

but only for the truth."

II Corinthians 13:8

Shani Naylor

unread,

Jan 19, 2026, 1:35:54 PM (12 hours ago) Jan 19

to dixo...@googlegroups.com

I'm the same as Debbie.

To view this discussion visit https://groups.google.com/d/msgid/dixonary/CANeWAQcbNS%3D9e4a5Kbgd%2BM_08kOnkzMQod2sM_0EX9AwR_m5FQ%40mail.gmail.com.

Glenn Thomas Davis

unread,

Jan 19, 2026, 8:15:58 PM (5 hours ago) Jan 19

to dixo...@googlegroups.com

In.

—Glenn

. . .

The mind is not a vessel to be filled, but a fire to be kindled.

—Plutarch

To view this discussion visit https://groups.google.com/d/msgid/dixonary/CAH53O2dGD1%2BGh1Yj%2BG-kikNTDCn7Jbk_%3DWYVTmTaYNubL%3DYHiA%40mail.gmail.com.

Reply all

Reply to author

Forward