AI deception

2 views

Skip to first unread message

John Clark

unread,

Dec 12, 2025, 3:06:45 PM (7 days ago) Dec 12

to ExI Chat, extro...@googlegroups.com, 'Brent Meeker' via Everything List

I found the following on Astral Codex Ten:

"A recent paper asked AIs whether they were conscious while monitoring them for signatures of deception, role-playing, and people-pleasing; it concluded that the AIs “genuinely” “believe” they are conscious, but sometimes try to deceive people into thinking they aren’t. Nostalgebraist tries to replicate this (X) and gets more ambiguous results; he says we probably can’t conclude anything just yet. See also the paper author’s reply here (X)."

John K Clark See what's on my new list at Extropolis

ere

Reply all

Reply to author

Forward

0 new messages