By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Tech Consumer JournalTech Consumer JournalTech Consumer Journal
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Reading: Anthropic Apologizes For One of the Guardrails on Its Fable 5 Model, and Will Change It
Share
Sign In
Notification Show More
Font ResizerAa
Tech Consumer JournalTech Consumer Journal
Font ResizerAa
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Search
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Have an existing account? Sign In
Follow US
  • Contact
  • Blog
  • Complaint
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Tech Consumer Journal > News > Anthropic Apologizes For One of the Guardrails on Its Fable 5 Model, and Will Change It
News

Anthropic Apologizes For One of the Guardrails on Its Fable 5 Model, and Will Change It

News Room
Last updated: June 11, 2026 9:55 am
News Room
Share
SHARE

Anthropic’s Fable 5 model is the nerfed version of Mythos, which is in turn the model so scarily powerful that it could ostensibly endanger the world if it were released without guardrails. Most of the guardrails, especially the ones designed to prevent users from using Fable to build cyber- or bio-weapons, are very noticeable.

But one guardrail, aimed at preventing users from using Fable 5 to train other AI models, was invisible, which sparked unusual displays of user outrage.

the claude fable 5 nerf for AI research has induced the angriest reaction from AI researchers that I’ve ever seen in my life

— Ethan Caballero (@ethanCaballero) June 10, 2026

 

And now Anthropic has asked for take-backs. The controversial invisible guardrail will be made visible. In a statement to Wired, Anthropic wrote “We’re changing Fable 5’s safeguards for frontier LLM development to make them visible.”

“We made the wrong tradeoff and we apologize for not getting the balance right,” the statement added.

In the model’s system card, Anthropic was upfront about what it was trying to do:

“Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts, these safeguards will not be visible to the user. Fable 5 will not fall back to a different model. Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT).”

In other words, when Fable 5 prompts showed the telltale signs of a user developing a frontier LLM, instead of doing what it does with prompts about biology, chemistry, or cybersecurity and switching to an inferior model, or simply refusing the request, it was silently changing the prompt in order to generate faulty results with the potential to hamper the user’s model development.

Using the model to train another model is against Anthropic’s terms of service, but users still felt like this measure was a violation of users’ trust. Reddit user CheatCodesOf Life put it this way: “I wouldn’t use this thing for anything to be honest. A refusal or HTTP-4xx error for content is fair enough, but this is basically taking your money and poisoning your code base.”

Read the full article here

You Might Also Like

Anthropic’s Mythos Safeguards Stoke Fears of a ‘Permanent Underclass’

The Trailer for the Social Network Sequel Is Here to Launch a Thousand Memes

‘A Knight of the Seven Kingdoms’ Creators Dig Into the Deeper Meaning of Dunk’s Big Speech

5 Things You Need to Know About NASA’s Artemis 3 Crew

Microsoft Exec Responds to Graduates Booing AI with Compelling Argument: ‘Nuh Uh’

Share This Article
Facebook Twitter Copy Link Print
Previous Article The Trailer for the Social Network Sequel Is Here to Launch a Thousand Memes
Next Article Anthropic’s Mythos Safeguards Stoke Fears of a ‘Permanent Underclass’
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1kLike
69.1kFollow
134kPin
54.3kFollow

Latest News

Insta360’s New Gimbal Vlogging Camera May Be Too Capable for Its Own Good
News
Bluesky Will Soon Have a Subreddit-Like ‘Communities’ Feature
News
Walton Goggins Teases His Family’s Future in ‘Fallout’ Season 3
News
Oracle Upsets the Market With Even More AI Spending and Debt Issuance
News
Layoffs Coming to Xbox Next Month, Report Says
News
White House Defangs AI-Testing Unit at the Worst Possible Time
News
Sen. Elizabeth Warren Makes Hail Mary Plea to Delay SpaceX IPO
News
Meta’s Ray-Bans Aren’t the Only Smart Glasses With a ‘Glasshole’ Problem
News

You Might also Like

News

Art Directors Guild Chides Martin Scorsese Over His Newfound Fondness for AI

News Room News Room 3 Min Read
News

Palantir CEO Says Bernie Sanders Will Regret Only Wanting 50% Public Ownership of AI Companies

News Room News Room 5 Min Read
News

The Real Winners of Apple’s WWDC 2026 Don’t Even Exist Yet

News Room News Room 7 Min Read
Tech Consumer JournalTech Consumer Journal
Follow US
2024 © Prices.com LLC. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • For Advertisers
  • Contact
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?