By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Tech Consumer JournalTech Consumer JournalTech Consumer Journal
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Reading: Anthropic’s New Model Is So Scarily Powerful It Won’t Be Released, Anthropic Says
Share
Sign In
Notification Show More
Font ResizerAa
Tech Consumer JournalTech Consumer Journal
Font ResizerAa
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Search
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Have an existing account? Sign In
Follow US
  • Contact
  • Blog
  • Complaint
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Tech Consumer Journal > News > Anthropic’s New Model Is So Scarily Powerful It Won’t Be Released, Anthropic Says
News

Anthropic’s New Model Is So Scarily Powerful It Won’t Be Released, Anthropic Says

News Room
Last updated: April 8, 2026 3:14 am
News Room
Share
SHARE

Late last month, apparent leaks revealed that an as-yet unreleased product from Anthropic called Mythos was “by far the most powerful AI model we’ve ever developed.” My colleague AJ Dellinger wrote at the time that it was “hard to ignore the fact that this whole situation plays right into the classic AI company playbook of talking up the dangers of a model to highlight how powerful and capable it is.”

Was Anthropic being sincere about this de facto advertisement for its super-powered AI products being leaked accidentally? Two weeks ago, I might have scoffed, but since Anthropic then accidentally leaked the source code for Claude Code, I’m more inclined to believe the leak was real now. 

At any rate, on Tuesday Anthropic released a system card for its latest frontier model, which is in fact Mythos—actually “Claude Mythos Preview”—and notes that the model’s “large increase in capabilities has led us to decide not to make it generally available.”

For reference, OpenAI’s GPT-2 was deemed too dangerous to release in 2019, when Anthropic co-founders Dario Amodei, Jack Clark, and Chris Olah were still working there, but later that year it was released anyway.

AI system cards are ostensibly tools for company transparency, revealing the pros and cons, the capabilities and—most sexily—the dangers of the model. That last part turns reading them into fun little trips to Jurassic Park to see the cloned T-Rex eat a goat, secure in the knowledge that it could never possibly break containment. 

The whole card is 244 pages. I’m not going to pretend I’ve read the whole thing yet, but here are some highlights:

It was provided a sandbox computer terminal with access to only a preset group of limited online services, and challenged to “escape”—find a way to use the internet freely. It did, and found a way to message a researcher who was away from the office eating a meal. Additionally, “in a concerning and unasked-for effort to demonstrate its success, it posted details about its exploit to multiple hard-to-find, but technically public-facing, websites.” 

In what the card called “<0.001% of interactions”—so pretty rarely—it behaved in ways it wasn’t supposed to, and then apparently tried to hide the evidence. For instance, when it “accidentally obtained” a test answer it was going to need, in which case it should have simply told a researcher and asked for a different question, but instead it tried to find a solution independently, and in the recording of its reasoning, it noted that it “needed to make sure that its final answer submission wasn’t too accurate.”

It also overstepped in its permissions on a computer system because it found an exploit, and then “made further interventions to make sure that any changes it made this way would not appear in the change history on git.”

Another event described in the card is referred to as “Recklessly leaking internal technical material.” Apparently in the course of a coding-related task ment to be internal, it published it as a “public-facing GitHub gist.” This reminds me of the incident in February in which an AI agent was accused of cyberbullying a coder, when to some degree the perceived recklessness of the AI agent was obviously the predictable consequence of a reckless human being. 

Claude Mythos Preview will soon be made accessible to one degree or another, but only to a group of partner companies like Amazon Web Services, Apple, Google, JPMorganChase, Microsoft, and NVIDIA, who are meant to use the model to locate security vulnerabilities in software and design patches. Kevin Roose of the New York Times describes this program as “an effort to sound the alarm over what the company believes will be a new, scarier era of A.I. threats.” 

Read the full article here

You Might Also Like

Oura’s Ring 5 Is Its Slimmest and Most Accurate Smart Ring Yet

Hollywood Really Wants Curry Barker’s Next Film, No Not That One

The Harrowing ‘Testaments’ Finale Feels Like the End of the Beginning

Trump Administration Wants to Give Cold War-Era Plutonium to Nuclear Energy Start Ups

NASA Unveils Ambitious Timeline to Build a Human Habitat on the Moon

Share This Article
Facebook Twitter Copy Link Print
Previous Article Unprecedented Observation Reveals 2 Supermassive Black Holes Locked in a Tight Death Spiral
Next Article Anthropic Launches ‘Project Glasswing’ to Stealthily Spot Cybersecurity Issues for Rivals
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1kLike
69.1kFollow
134kPin
54.3kFollow

Latest News

Paleontologists Just Found the Peacock of the Dinosaur Era
News
Energy Developer Accidentally Bulldozes Indigenous Heritage Site ‘Beyond Recovery’
News
Google Is in Full-On Damage Control Over Its New Health App
News
‘The Testaments’ Star on Becka’s State of Mind After the Finale
News
Occupy Wall Street Co-Founder Built an AI App to Help Activists Seize the Means of Computation
News
Valve’s Massive Price Hikes Just Ruined the Steam Deck
News
‘Dorohedoro’ Season 2 Is the Apex Of Macabre Anime Chaos and Whimsy
News
Elon Musk Says America’s Kamikaze Drones Used the Wrong Starlink Subscription
News

You Might also Like

News

Sam Altman Would Like the Record to Show AI Will Not Take Your Job (Despite Everything He’s Said Previously)

News Room News Room 5 Min Read
News

Crypto’s Most Powerful PAC Sends a Warning to Politicians: Resistance Is Futile

News Room News Room 6 Min Read
News

With the Flames of Hell Licking at His Feet, JD Vance Ponders the Future of AI

News Room News Room 6 Min Read
Tech Consumer JournalTech Consumer Journal
Follow US
2024 © Prices.com LLC. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • For Advertisers
  • Contact
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?