Relevant to Jas's comments on my idea for AI agent safety

1 view
Skip to first unread message

Alan Karp

unread,
Feb 27, 2026, 10:23:27 PM (2 days ago) Feb 27
to <friam@googlegroups.com>

from the link he sent out by mistake, https://github.com/nearai/ironclaw.
  • Semantic interposition. Instead of giving the agent raw system access, all interactions go through MCP servers (filesystem, git, etc.). Every tool call passes through a policy engine that can allowdeny, or escalate to the user for approval

--------------
Alan Karp

Mark S. Miller

unread,
Feb 27, 2026, 10:58:30 PM (2 days ago) Feb 27
to fr...@googlegroups.com
Relevant to Jas's comments on my idea for AI agent safety

Sorry I missed it. Where do I find your idea and Jasvir's comments?


--
You received this message because you are subscribed to the Google Groups "friam" group.
To unsubscribe from this group and stop receiving emails from it, send an email to friam+un...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/friam/CANpA1Z3mEBKJa%2B%2ByB38fVL7Ex2dn%2BXjcNMzbLcVW_9UW-CSZzw%40mail.gmail.com.

Alan Karp

unread,
Feb 28, 2026, 1:11:17 AM (yesterday) Feb 28
to fr...@googlegroups.com
Well, then, you'll just have to get your priorities straight and show up, won't you?

My basic idea that I've talked about in earlier friams is pretty much what's in that bullet point. 

--------------
Alan Karp


Ben Laurie

unread,
Feb 28, 2026, 3:51:20 PM (12 hours ago) Feb 28
to fr...@googlegroups.com
OK, and how does that policy engine work?

Alan Karp

unread,
Feb 28, 2026, 7:11:22 PM (9 hours ago) Feb 28
to fr...@googlegroups.com
In my proposal, it does 2 things.  The proxy mediates all communication to and from the LLM agent, and it holds any secrets that the agent needs to authenticate, delegate, invoke, or revoke.  The proxy will only sign requests that satisfy your policy, sort of like a request side PDP.  It knows what data you want kept private and won't distribute it if the agent asks it to.  In a more advanced version, it manages the agent's persistent memory.

--------------
Alan Karp


๏̯͡๏ Jasvir Nagra

unread,
1:36 AM (2 hours ago) 1:36 AM
to fr...@googlegroups.com
The challenge I was having iirc during the call was - "if I am deducing if a message is acceptable or not" just by looking at the message, we are already lost.  I felt that a separate policy that steps in to decide after the fact violated the grant vs use principle on when you do the check that always felt fundamental to me on what made ocap systems ocap systems.

-- 
Jasvir Nagra


Reply all
Reply to author
Forward
0 new messages