AI agents

108 views
Skip to first unread message

Tim Macpherson

unread,
Dec 30, 2025, 2:25:14 PM (9 days ago) 12/30/25
to GWT Users


 Following on from a recent post about AI, I  use basic chatgpt or Gemini. Should I be using AI agents such as Claude Code, OpenAI’s Codex, etc ?  My current AI usage is ok for specifics but awful at remembering things.

Jens

unread,
Dec 31, 2025, 10:53:50 AM (8 days ago) 12/31/25
to GWT Users
What is your goal? AI agents running locally are usually a security nightmare (data deletion, secret exfiltration via prompt injection) so you have two choices:

1.) babysit them all the time and review, understand and confirm any command the agent wants to execute. Otherwise it might delete your drive accidentally and feels really sorry about it or it reads an untrusted website with some prompt injection sections which causes the model to do something totally different. As a middleground you can allow read/search commands by default but still review everything else.

2.) Run the agent autonomously in a secure environment to protect your local data. This can be a local VM, a local docker container (e.g. use devcontainers for development in general or just run the agent in a container) or an OS provided sandbox mechanism. Then you make only the folders accessible to the environment (and thus the AI agent) which are required. If your agent should use the internet for research then you must control network requests (e.g. allowed domains).

Depending on the code size and the task complexity an AI agent is relatively slow as it does a lot of research in the code base first before implementing stuff. So if you work on a laptop it might be annoying to wait for the agent to complete. So the last option is using agents in the cloud which also solves the security issue for your local data as the cloud solutions usually must clone your repository from somewhere.

-- J.

Tim Macpherson

unread,
Dec 31, 2025, 11:49:29 AM (8 days ago) 12/31/25
to google-we...@googlegroups.com
Thanks, I heard the agents in cloud can be a security risk for proprietary code, but there are options to keep it private.  Thinking about it an agent would be overkill for my purposes, I just want something that doesn't have to be constantly corrected because it's forgotten my current stack. It can just about remember that I'm using Java GWT and D3 js, memory of a 4 year old
--
You received this message because you are subscribed to the Google Groups "GWT Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to google-web-tool...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/google-web-toolkit/9585ff0f-015e-4344-931f-6610ae632644n%40googlegroups.com.

Tim Macpherson

unread,
Jan 2, 2026, 1:30:31 PM (6 days ago) Jan 2
to google-we...@googlegroups.com
Gemini just output that for a complex refactor where I'm stripping out pure java logic from a GWT client module incrementally in a correct order so the app isn't broken at any stage: AI  can't do that, agents or whatever. Maybe in 5 years, I have no idea

Jens

unread,
Jan 3, 2026, 11:13:17 AM (5 days ago) Jan 3
to GWT Users
How do you use Gemini / ChatGPT? Via some IDE plugin or just their website? 

For example if you use Github Copilot and use the official IDE plugin then you can store markdown files in your project which describe your stack, coding patterns, etc. The plugin will then read these files and add the content to the context of each chat session you start. That way the model always knows these information. These files are different for models / plugins, e.g. Gemini CLI searches for GEMINI.md files if I remember correctly. You have to read the docs of the tools you are using.

These language models are good at pattern recognition and pattern completion. They cannot read your mind nor have a mind. What matters is that the context passed to the model is good and that it can see clear instructions so it must not guess. Only then the output will be good. 

-- J.

Tim Macpherson

unread,
Jan 3, 2026, 12:20:53 PM (5 days ago) Jan 3
to google-we...@googlegroups.com
I asked chatgpt about GitHub copilot and Gemini CLI,  Re  complex refactor where I'm stripping out pure java logic from a GWT client module incrementally in a correct order so the app isn't broken at any stage:   Chatgpt also says: your instinct — “maybe AI can’t do that” — is correct. It uses an analogy:
AI is good at: rebinding books, rewriting titles, rearranging whole shelves
It is bad at: Knowing which single book is safe to slide now.
That’s human judgement grounded in system knowledge.

Craig Mitchell

unread,
Jan 3, 2026, 8:20:25 PM (5 days ago) Jan 3
to GWT Users
If you use IntelliJ, I highly recommend the Windsurf plugin.  It allows you to use any model you like.  ChatGPT, Gemini, Claude Sonnet, ...

And as Jens says, it reads your project, so it knows what you're doing, and it's scarily good at helping you out (refactoring, or doing anything).

Privacy and keeping intellectual property private probably goes out the window though.  If you need that, you'd need a sandboxed Copilot subscription, or something similar.

Vegegoku

unread,
Jan 4, 2026, 3:01:35 AM (5 days ago) Jan 4
to GWT Users
You can also use codex in the terminal with a persistence context, you start your first session and make it try to learn and understand the project context, draft an md file from what it learns, review and refine, repeat and update the file - for example AI_CONTEXT.md- until you have a solid starting context, add an instruction in the file to update it with new learnings an findings from every subsequencce sessions, add a shell script what ever you like in your project to load codex with that file as context, from that point on you will have an incrementally built persisted context and you dont worry about the agent forgetting all the context you had before., this might also work with any agent as long as you can invoke it with a pre-defined context.

I found this useful for my case. 

David Nouls

unread,
Jan 4, 2026, 3:51:56 AM (5 days ago) Jan 4
to GWT Users
Windsurf can guarantee the privacy. The company where I work is very strict on what we are allowed, but they allow us to use Windsurf. 
 
It is quite impressive at times, but it is not very consistent. One day it blows my mind, the next day (with exact same prompt) it delivers something stupid.
 
But it does help me a lot in trying to out new ideas as long as you are willing to put in extra work to polish it into something usable and maintainable.

Tim Macpherson

unread,
Jan 4, 2026, 12:13:34 PM (4 days ago) Jan 4
to google-we...@googlegroups.com
Thanks all for advice, I'm looking at Gemini cli today with non-agentic commands - don't do anything without asking. It has a generous free tier unlike Codex. It should solve the memory issues with a single Gemini.md context, but already it's done  a 4 minute analysis ignoring the stated dependencies. Generally an improvement though, wondering if $200 pm Codex is worth it.

Vegegoku

unread,
Jan 5, 2026, 9:51:19 AM (3 days ago) Jan 5
to GWT Users
When I mentioned codex I was talking about this https://developers.openai.com/codex/cli and from the pricing page it is available for 20$/month subscriptions.

Tim Macpherson

unread,
Jan 5, 2026, 12:44:03 PM (3 days ago) Jan 5
to google-we...@googlegroups.com
The $20 pm codex is not for full-time usage, but not sure if I need full-time, so far Gemini cli is better than eclipse at tracing references, haven't done much else with it. The UI is awkward compared to webpage AI,  I'd guess most Devs use both, webpage  chat with .md context file would be useful but I haven't heard of that
Reply all
Reply to author
Forward
0 new messages