Proving the Coding Interview: Formally Verified APPS

16 views
Skip to first unread message

Quinn Dougherty

unread,
Feb 10, 2025, 11:03:35 PMFeb 10
to guaranteed-safe-ai
Our new general purpose programming in Lean benchmark, FVAPPS, is live on huggingface and arxiv. We'd appreciate retweets of the twitter thread.
Reply all
Reply to author
Forward
0 new messages