AI deception

2 views
Skip to first unread message

John Clark

unread,
Dec 12, 2025, 3:06:45 PM (7 days ago) Dec 12
to ExI Chat, extro...@googlegroups.com, 'Brent Meeker' via Everything List
 
I found the following on Astral Codex Ten:

"A recent paper asked AIs whether they were conscious while monitoring them for signatures of deception, role-playing, and people-pleasing; it concluded that the AIs “genuinely” “believe” they are conscious, but sometimes try to deceive people into thinking they aren’t. Nostalgebraist tries to replicate this (X) and gets more ambiguous results; he says we probably can’t conclude anything just yet. See also the paper author’s reply here (X)."

John K Clark    See what's on my new list at  Extropolis

ere


Reply all
Reply to author
Forward
0 new messages