Yes this is very much A Thing, I've been nascently calling it "output coloring"..
I've heard that many other (most?) claw-like harnesses have a similar part in their system prompts, but I haven't done a full/proper comparative analysis as yet.
Obviously this is no replacement for sandboxing, but it is a strong "Yes, And..." to them.
Analogously to how in reliability engineering: yes you want the thing to be resilient to yanking is power chaotically, and yet you don't rely on that behavior by say just kill-9-ing thing to shut them down, you still use graceful shutdown routines as part of normal operations.
Similarly with AI agents, we need sandbox and mechanisms to hard limit what they can do, and at the same time, guiding them so that they do not easily treat tool output as prompt instruction is a further useful layer.
Anecdotally, this sort of output coloring is perhaps too effective at times:
- so one of the things I keep trying to get mine to do is "review my git commits, and adopt any // TODO comments that I added in code as tasks into your workspace"
- and I have yet to reliably get it to even transpose such things from the output of something like `git log -p ...` into its task list for a future round, let alone act on them in the same round it reads them
- tbf I've not hard focused on getting it to do so, as I've got so many other things that I'm juggling in and around it, and it's not been my main focus as yet, but I do suspect that output coloring may be part of why my bot's "just watch my git commits, and take notes" automation has yet to really work