Groups
Groups
Sign in
Groups
Groups
guaranteed-safe-ai
Conversations
About
Send feedback
Help
guaranteed-safe-ai
Contact owners and managers
1–30 of 219
Mark all as read
Report group
0 selected
Orpheus Lummis
Sep 17
GSAI Seminar October 2025 – Model-Based Soft Maximization of Suitable Metrics of Long-Term Human Power (Jobst Heitzig)
You are invited to the October 2025 edition of the Guaranteed Safe AI Seminars: Model-Based Soft
unread,
GSAI Seminar October 2025 – Model-Based Soft Maximization of Suitable Metrics of Long-Term Human Power (Jobst Heitzig)
You are invited to the October 2025 edition of the Guaranteed Safe AI Seminars: Model-Based Soft
Sep 17
Orpheus Lummis
3
Sep 11
GSAI Seminar August 2025 – Towards Safe and Hallucination-Free Coding AIs (GasStationManager)
The event's recording is now available: https://www.youtube.com/watch?v=AhYUoUzAqf8 On Thursday,
unread,
GSAI Seminar August 2025 – Towards Safe and Hallucination-Free Coding AIs (GasStationManager)
The event's recording is now available: https://www.youtube.com/watch?v=AhYUoUzAqf8 On Thursday,
Sep 11
Orpheus Lummis
Sep 10
Speaker recommendations for the Guaranteed Safe AI Seminars
Hi all, We're looking for speaker recommendations for the Guaranteed Safe AI Seminars – a monthly
unread,
Speaker recommendations for the Guaranteed Safe AI Seminars
Hi all, We're looking for speaker recommendations for the Guaranteed Safe AI Seminars – a monthly
Sep 10
Marta Bieńkiewicz
Sep 3
[Seminar Invite]: Do LLM Agents Cooperate? ('Updates in Cooperative AI' series)
The Cooperative AI Foundation is pleased to announce the next seminar in our 'Updates in
unread,
[Seminar Invite]: Do LLM Agents Cooperate? ('Updates in Cooperative AI' series)
The Cooperative AI Foundation is pleased to announce the next seminar in our 'Updates in
Sep 3
Andy E Williams
Aug 8
Final Notice – Free Online Workshop: Visualizing Corrigibility, Alignment, and Safety (Aug 10)
You're invited to a free online workshop: “Visualizing Corrigibility, Alignment, and Safety from
unread,
Final Notice – Free Online Workshop: Visualizing Corrigibility, Alignment, and Safety (Aug 10)
You're invited to a free online workshop: “Visualizing Corrigibility, Alignment, and Safety from
Aug 8
Marta Bieńkiewicz
Jul 31
[Seminar Invite]: Modelling humans with neural networks for behavioural mechanism-design
The Cooperative AI Foundation is pleased to announce the next seminar in our 'Updates in
unread,
[Seminar Invite]: Modelling humans with neural networks for behavioural mechanism-design
The Cooperative AI Foundation is pleased to announce the next seminar in our 'Updates in
Jul 31
Andy E Williams
Jul 14
Visualizing AI Alignment – CFP for AGI-2025 Workshop (Aug 10, Live + Virtual)
Purpose. This workshop invites submissions of 2-page briefs about any model of intelligence of your
unread,
Visualizing AI Alignment – CFP for AGI-2025 Workshop (Aug 10, Live + Virtual)
Purpose. This workshop invites submissions of 2-page briefs about any model of intelligence of your
Jul 14
Orpheus Lummis
3
Jul 11
GSAI Seminar July 2025 – Engineering Rational Cooperative AI via Inverse Planning and Probabilistic Programming (Tan Zhi Xuan)
The recording is now available: https://www.youtube.com/watch?v=uiJ1dmmNL0k On Thu, Jul 3, 2025 at 12
unread,
GSAI Seminar July 2025 – Engineering Rational Cooperative AI via Inverse Planning and Probabilistic Programming (Tan Zhi Xuan)
The recording is now available: https://www.youtube.com/watch?v=uiJ1dmmNL0k On Thu, Jul 3, 2025 at 12
Jul 11
Yudhister Joel Kumar
,
Quinn Dougherty
4
Jul 5
contributing to an open problems list
Thanks! This is all very helpful. Will follow-up privately with clarifying questions in a bit. We
unread,
contributing to an open problems list
Thanks! This is all very helpful. Will follow-up privately with clarifying questions in a bit. We
Jul 5
Jonas Kgomo
Jun 21
Verification of Frontier AI – UN Scientific Advisory Board
The UN Scientific Advisory Board has published a Science Brief on Verification of Frontier AI which
unread,
Verification of Frontier AI – UN Scientific Advisory Board
The UN Scientific Advisory Board has published a Science Brief on Verification of Frontier AI which
Jun 21
Marta Bieńkiewicz
Jun 18
[Seminar Invite] Exploring Multi-Agent Risks in Advanced AI
The Cooperative AI Foundation is delighted to invite you to our 'Exploring Multi-Agent Risks from
unread,
[Seminar Invite] Exploring Multi-Agent Risks in Advanced AI
The Cooperative AI Foundation is delighted to invite you to our 'Exploring Multi-Agent Risks from
Jun 18
Evan Miyazono
Jun 2
paper + opportunity for current grad students
Hi all, ## FM benchmarks I recently flagged CLEVER from Thakur et al (from mostly UT Austin folks)
unread,
paper + opportunity for current grad students
Hi all, ## FM benchmarks I recently flagged CLEVER from Thakur et al (from mostly UT Austin folks)
Jun 2
Evan Miyazono
May 26
FYIs
Hey GSAI community, I'd be surprised if all of this is new, but I'd also be surprised if none
unread,
FYIs
Hey GSAI community, I'd be surprised if all of this is new, but I'd also be surprised if none
May 26
Quinn Dougherty
Mar 14
forethought's new post is pretty GSAI pilled (not in those words)
https://www.forethought.org/research/ai-tools-for-existential-security Just needs a little nudging
unread,
forethought's new post is pretty GSAI pilled (not in those words)
https://www.forethought.org/research/ai-tools-for-existential-security Just needs a little nudging
Mar 14
no...@pibbss.ai
,
Allison Duettmann
2
Feb 15
Symposium on AI Verification, Zagreb on July 21–22, 2025
This is awesome, thanks for sharing! On Sat Feb 8, 2025, 07:29 AM GMT, no...@pibbss.ai wrote: https://
unread,
Symposium on AI Verification, Zagreb on July 21–22, 2025
This is awesome, thanks for sharing! On Sat Feb 8, 2025, 07:29 AM GMT, no...@pibbss.ai wrote: https://
Feb 15
Quinn Dougherty
Feb 10
Proving the Coding Interview: Formally Verified APPS
Our new general purpose programming in Lean benchmark, FVAPPS, is live on huggingface and arxiv. We
unread,
Proving the Coding Interview: Formally Verified APPS
Our new general purpose programming in Lean benchmark, FVAPPS, is live on huggingface and arxiv. We
Feb 10
Evan Miyazono
Feb 7
DARPA program on math
Possibly of interest https://sam.gov/opp/4def3c13ca3947069b1779e7ff697c6a/view > The goal of
unread,
DARPA program on math
Possibly of interest https://sam.gov/opp/4def3c13ca3947069b1779e7ff697c6a/view > The goal of
Feb 7
Quinn Dougherty
, …
Ronak Mehta
6
Feb 5
October Paper Club: models that prove their own correctness
Potentially interesting seminar next week, following up on our discussion about this paper last
unread,
October Paper Club: models that prove their own correctness
Potentially interesting seminar next week, following up on our discussion about this paper last
Feb 5
Agustín Martinez Suñé
,
Evan Miyazono
2
Jan 28
SafePlanBench (work in progress)
Just personally contributed the first $500 - good luck with the rest! On Mon, Jan 27, 2025 at 6:54 AM
unread,
SafePlanBench (work in progress)
Just personally contributed the first $500 - good luck with the rest! On Mon, Jan 27, 2025 at 6:54 AM
Jan 28
Quinn Dougherty
, …
Kris Carlson
3
Jan 24
on deepseek's r1
I'm very interested. Thanks Quinn. - Kris On Fri, Jan 24, 2025 at 5:54 PM Jacques Thibodeau <
unread,
on deepseek's r1
I'm very interested. Thanks Quinn. - Kris On Fri, Jan 24, 2025 at 5:54 PM Jacques Thibodeau <
Jan 24
Quinn Dougherty
Jan 24
Nov-Dec 2024 Progress in Guaranteed Safe AI
https://gsai.substack.com/p/november-december-2024-progress-in Sorry for the radio silence last month
unread,
Nov-Dec 2024 Progress in Guaranteed Safe AI
https://gsai.substack.com/p/november-december-2024-progress-in Sorry for the radio silence last month
Jan 24
J Heitzig
, …
Syed Jafri
3
Jan 14
Anyone going to IASEAI?
This looks interesting but it looks like applications are closed. I would be curious about any future
unread,
Anyone going to IASEAI?
This looks interesting but it looks like applications are closed. I would be curious about any future
Jan 14
Orpheus Lummis
,
Orpheus Lummis
2
Jan 9
GS AI Seminar January 2025 – Using PDDL Planning to Ensure Safety in LLM-based Agents (Agustín Martinez Suñé)
Recording available: https://youtu.be/anbsnwnMpf8?si=Kz73MzCzZAy9tJzF On Thursday, 12 December 2024
unread,
GS AI Seminar January 2025 – Using PDDL Planning to Ensure Safety in LLM-based Agents (Agustín Martinez Suñé)
Recording available: https://youtu.be/anbsnwnMpf8?si=Kz73MzCzZAy9tJzF On Thursday, 12 December 2024
Jan 9
Orpheus Lummis
Jan 5
Guaranteed Safe AI Seminars 2025 – Invitation to present or attend
Hello everyone, Happy New Year! We're excited to invite you all to our 2025 seminar series, which
unread,
Guaranteed Safe AI Seminars 2025 – Invitation to present or attend
Hello everyone, Happy New Year! We're excited to invite you all to our 2025 seminar series, which
Jan 5
Orpheus Lummis
12/15/24
Guaranteed Safe AI Seminars 2024 review
Dear Guaranteed Safe AI enjoyers, Thanks for having participated or being interested in the seminar
unread,
Guaranteed Safe AI Seminars 2024 review
Dear Guaranteed Safe AI enjoyers, Thanks for having participated or being interested in the seminar
12/15/24
Orpheus Lummis
12/8/24
GS AI Seminar December 2024 – Compact Proofs of Model Performance via Mechanistic Interpretability (Louis Jaburi)
You are invited to the December 2024 edition of the Guaranteed Safe AI Seminars: Compact Proofs of
unread,
GS AI Seminar December 2024 – Compact Proofs of Model Performance via Mechanistic Interpretability (Louis Jaburi)
You are invited to the December 2024 edition of the Guaranteed Safe AI Seminars: Compact Proofs of
12/8/24
Quinn Dougherty
11/15/24
Seats available at a small-ish meeting in Berkeley early december
Hey all, AIxFV meeting coming up December 5th and 6th in Berkeley. Room for more, just email me for
unread,
Seats available at a small-ish meeting in Berkeley early december
Hey all, AIxFV meeting coming up December 5th and 6th in Berkeley. Room for more, just email me for
11/15/24
Orpheus Lummis
,
Orpheus Lummis
2
11/14/24
GSAI Seminar November 2024 – Bayesian oracles and safety bounds (Yoshua Bengio)
Recording now available: https://www.youtube.com/watch?v=SIAZKT-VJIU On Friday, October 11th, 2024 at
unread,
GSAI Seminar November 2024 – Bayesian oracles and safety bounds (Yoshua Bengio)
Recording now available: https://www.youtube.com/watch?v=SIAZKT-VJIU On Friday, October 11th, 2024 at
11/14/24
Quinn Dougherty
10/28/24
Next month's paper club: expectational relational hoare logic (blurb about it in this month's newsletter)
https://gsai.substack.com/i/150866523/a-quantitative-probabilistic-relational-hoare-logic: ... If you
unread,
Next month's paper club: expectational relational hoare logic (blurb about it in this month's newsletter)
https://gsai.substack.com/i/150866523/a-quantitative-probabilistic-relational-hoare-logic: ... If you
10/28/24
Quinn Dougherty
10/25/24
anyone have takes on zero knowledge machine learning?
Hey all, if you have takes on ZKML please book me https://calendly.com/quinn-dougherty - i'm
unread,
anyone have takes on zero knowledge machine learning?
Hey all, if you have takes on ZKML please book me https://calendly.com/quinn-dougherty - i'm
10/25/24