--
You received this message because you are subscribed to the Google Groups "Ipse Dixit" group.
To unsubscribe from this group and stop receiving emails from it, send an email to Ipse-dixit+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/Ipse-dixit/CAAq%2BNfeU7R3uB8HNmRRfhjG5TCWqVh8ymCJj6qit3dmHOLL%3D4Q%40mail.gmail.com.
On Apr 8, 2026, at 3:41 PM, Asif Ahsan <aah...@gmail.com> wrote:I find the framing of this product release interesting. By labeling a model as "too dangerous for the public," a company can generate significant brand prestige and "capability hype" to inflate its valuation. Some experts argue this approach is as much a marketing masterclass as it is a security precaution, especially given Anthropic's need for funding.The fact that the new model hallucinates in nearly one in five complex scenarios suggests that, internally, they may be more concerned with the system's limitations than its power. It seems more like a marketing stunt or propaganda to me.Broadly speaking, I think that the use of apocalyptic language like "dangerous" or "the end of humanity" regarding AI is largely performative in the media. These outcomes aren't realistic for a large language model or any model of software in existence.Asif
> "Honey, what did you do after school today?”
I'm no-one's thought leader, but two things I know for sure:
1. Much as little Tommy Friedman may have enjoyed War Games and Hackers, the big risks from this development are NOT posed by rogue teenagers with too much time on their hands.
Alright, I'm being snide. Friedman was 30 when War Games came out and none of us have got any younger. But the two types of actors posing the biggest threats in contemporary cybersecurity are criminals and state actors (or combinations of those–thanks, Vlad!)
Friedman's clichéed invocation of the unsupervised teen in his bedroom doesn't really help any NYT reader understand the real issues at play.
2. A model that can efficiently generate zero-day code exploits is NOT evidence of "Superintelligent A.I."
In fact, it's something that should have been expected. Code is 100% deterministic and automated analysis doesn't require any "world model" or ability to infer the author's intent. Also, the zero-day examples Anthropic has have given (e.g. OpenBSD and FFmpeg) are notably in open-source software.
I'm not saying that to diss open-source—huge portions of our current Internet runs on open-source code and I firmly believe the "many eyes" approach results in better, more-secure systems.
But I'm assuming these examples indicate how Claude Mythos achieved its impressive feat. It analyzed the source code to identify attack vectors, sometimes chaining together as many as four different vulnerabilities to generate an exploit. To me, this is A GOOD THING.
Current systems likely are vulnerable to all the exploits that Anthropic found and many more. The "threat landscape" has certainly been changed by Claude Mythos, and we're somewhat fortunate that Anthropic is behind this because they do have what passes for high moral standards in 2026.
They started out by disclosing some/all of the 1000+ vulnerabilities they found to the relevant developers. This follows the industry-standard Coordinated Vulnerability Disclosure process: https://www.cisa.gov/resources-tools/programs/coordinated-vulnerability-disclosure-program (Fun fact: CISA recommends making these disclosures via Carnegie-Mellon's Vulnerability Information and Coordination Environment, aka VINCE.)
Anthropic also realized that given the huge variety of existing deployed code they couldn't just send out a few notifications and then ship the Claude Mythos model. Instead, they were going to have to give access to major software vendors, so that they could scan their own code. After hand-picking a few close partners to be named in Tuesday's announcement (Marketing needs to have something to keep them busy), they also signed up 40 unnamed vendors to form the second wave of vendor patching. Presumably, there will be an ongoing program of giving additiobal vendors access to Mythos to scan (and patch) ever more niche software.
It's a big effort, but once it's done it's not likely we'll see zero-days appear, well, every day. There is a finite quantity of exploitable vulnerabilities in existing software. And it's reasonable to think that Claude Mythos can find a significant portion of the currently unknown ones. Future versions of this model or another AI vendor's model may discover additional vulnerabilities, but not indefinitely. So, in time, existing software should be better patched. Which means fewer vulnerabilities for a ransomware gang to exploit.
It does seem, however that Anthropic has uncovered a way to get every software vendor to treat them nicely, at least for a while. Because as well as scanning the existing code base, vendors will want to scan their upcoming code releases for anything Claude Mythos can uncover.
To that end, I hope they expand their Project Glasswing initiative to encompass much of the industry, with possible exception of OpenAI and X.ai. ;-)
Going back to Friedman's write-up, he says that "representatives of leading tech companies have been in private conversation with the Trump administration about the implications for the security of the United States and all the other countries that use these now vulnerable software systems".
This is where I get chills. Formerly, CISA could be mostly relied upon to prioritize public good over partisan interest. Like most of the rest of the government cybersecurity world, it has been an essentially non-partisan agency despite the fact that many of its employees have likely never voted for a Democrat.
But I'm seriously concerned that Federal cybersecurity defense is now hamstrung by the same sort of budget cuts, corrupt appointments and general fear that the regime has brought to the rest of government. Here's one example from as long ago as Monday:
> Cuts hit CISA, NIST and IRS in Trump’s FY27 budget. https://www.govexec.com/technology/2026/04/cuts-hit-cisa-nist-and-irs-trumps-fy27-budget/412636/
I think Anthropic and other responsible vendors can probably address the risks of exploit-optimized coding models without Federal coordination, but I'd much prefer this to happen in an environment where CISA is a trustworthy partner and not yet another risk factor.
===
Unexpectedly, I just found myself sitting in presentations yesterday and today from AI researchers at IBM Research and MIT on topics such as giving AI agents memory so they can learn from experience, and inferring causality rather than just correlation. So now I feel thoroughly primed for a whole summer of Ipse Dixit conversations!
Cheers,
Rupert
To view this discussion visit https://groups.google.com/d/msgid/Ipse-dixit/CAGc2QJqz3Ji40C7__gXEWaE6KSmazY8dvmDvUFVXSL9PAU7LSg%40mail.gmail.com.
To view this discussion visit https://groups.google.com/d/msgid/Ipse-dixit/CAEfHkaqvZJK5SvfwsX9kyOvOy3V%3DPNP1PVRthNgjjtfnEetN3g%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "Ipse Dixit" group.
To unsubscribe from this group and stop receiving emails from it, send an email to Ipse-dixit+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/Ipse-dixit/CAGc2QJoFz1uC3ce8Ts%3Dr54HhVBHvyT0ir4yMEssXsrE7i88F8g%40mail.gmail.com.