Glu night,
Our meeting will be Saturday, October 19th, 2:30 PM @ 848 Divisadero Street. This week, we are looking at Emergent World Representations where researchers used mechanistic interpretability techniques to determine if Othello-GPT maintains an internal board state. The saga continued with Neel Nanda coming to an even stronger result.
Why Othello drama is cool:
Sonnet 3 + mech interp = Sonnet 3.5
Neel spent only a weekend on the project
Hot damn, check out this graph!
Best,
Cheikh and Sasha
P.S. if you are somehow reading this email but not on our listserv join it here. If you are on our listserv, send it to your friends.