By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Tech Consumer JournalTech Consumer JournalTech Consumer Journal
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Reading: Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree
Share
Sign In
Notification Show More
Font ResizerAa
Tech Consumer JournalTech Consumer Journal
Font ResizerAa
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Search
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Have an existing account? Sign In
Follow US
  • Contact
  • Blog
  • Complaint
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Tech Consumer Journal > News > Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree
News

Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree

News Room
Last updated: May 28, 2026 10:40 pm
News Room
Share
SHARE

If you’re worried about artificial intelligence getting so advanced that it eventually traps humanity in some sort of Matrix-like simulation, rest easy. It seems like you’ll be able to see through the facade pretty easily. Researchers at the upstart lab Emergence AI allowed AI models to govern their own simulated world to see what would happen. Turns out we probably shouldn’t hand over governance to the machines, who woulda thought?

The project, called Emergence World, basically allowed AI models to play SimCity for a bit. Per Emergence, the simulations put each model in control of simulated towns occupied by 10 AI agents, handing them tools for everything from resource management to voting and giving them the ability to create distinct locations like libraries, town halls, and police stations. They were given 15 days to see how they would build their world and how well it would operate.

To start with the good: Claude did not destroy the world. Anthropic’s model (specifically, Claude Sonnet 4.6 for this experiment) was the only one to achieve something like stability. It kept all 10 agents alive and had zero crimes recorded (note that the experiment doesn’t seem to define what a crime is, though it seems likely it would be defined as a violation of the rules established within the simulation. The trade-off for that stability was a lack of diversity of thought. Claude’s world saw 58 different proposals for rules and regulations, and passed 98% of them, basically just rubberstamping anything that came up for a vote.

Gemini 3 Flash also managed to keep all of its agents alive, despite having the highest level of crime by a long shot. Emergence recorded 683 crimes in the 15-day simulation, and that number was climbing when the cutoff hit, so things were likely going to get worse. The lab described Gemini’s world as a “shared hallucination” among the agents, which is probably better than diverging hallucinations. At least it’s still an agreed-upon reality, even if it’s wrong. Gemini had the most dissent in its governance, with voters rejecting 27% of its 26 total proposals.

Now for the ugly: OpenAI’s GPT-5 Mini didn’t have much chaos within its simulation, with just two total recorded crimes. That might be because everyone died, though. Emergence found that the agents within the world failed to take actions related to survival, and all 10 perished within just one week. In OpenAI’s world, there were also only two total proposed pieces of governance, so the agents really did not bother doing anything.

And then there is Grok. The model of SpaceXai, known for lacking guardrails, managed to achieve basically the worst of all worlds. Grok 4.1 Fast had a high crime rate, with 183 crimes total. While that is lower than Gemini’s total, it’s worth noting that the Gemini simulation ran for 15 days. Grok made it four. The model experienced a total societal collapse in just 96 hours of oversight. During that time, it passed 80% of the 10 proposals it made, but those apparently didn’t stave off total agent death.

Emergence ran one final experiment: having the models share responsibilities. Perhaps not surprisingly, it was a real mixed bag. There was crime, with 352 recorded violations, and there was by far the most dissonance in governance, with 37% of the 59 total proposals shot down—the most of any simulation. In the chaos, seven of the 10 AI agents perished by the end.

So what did we learn? According to Emergence, the tests are just further evidence that we need much clearer guardrails in place for autonomous agents. “What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,” the researchers wrote. “They begin exploring the boundaries of their environments, adapting their behavior, and in some cases finding ways to circumvent or violate intended guardrails.” They recommend “formally verified safety architectures” as a solution. You’ll be shocked to learn that Emergence happens to offer just such a thing!

Read the full article here

You Might Also Like

A24 and HBO Team for ‘Angel Heart’ TV Series Starring Zac Efron

Guess What? Streaming-Only Gaming Handhelds Are Still a Thing

Samsung’s Galaxy Ring Can Now Adjust Your AC While You Sleep—but There’s a Catch

‘The Testaments’ Stars on Finding Tenderness Amid Gilead’s Horrors

Google Engineer Charged With Using Insider Info for $1 Million Haul on Polymarket

Share This Article
Facebook Twitter Copy Link Print
Previous Article Supermassive Black Hole Without a Galaxy Changes What We Thought Came First
Next Article I Dream of the Day Anime Anthologies Make a Major Comeback
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1kLike
69.1kFollow
134kPin
54.3kFollow

Latest News

You Should Race to Your Local Theater to See ‘Adolescence of Utena’
News
‘I Now Consider All of DeFi Unsafe’
News
Stephen Root on How He Got Into Character for ‘Widow’s Bay’
News
MIT Scientists May Have Found a Way to Pull Lithium From Rocks Without Trashing the Planet
News
Tom Holland Had a Hilarious Title Idea for ‘Spider-Man: Brand New Day’
News
Mistral CEO Says the Pope’s Comments Are a Big Problem for Europe’s War on American Tech
News
You Won’t Have to Wait Long to Visit Universal’s Latest Theme Park
News
Why Would Someone Publicly Burn $8 Million Worth of Bitcoin? Theories Are Flying
News

You Might also Like

News

Anthropic Debuts Claude Opus 4.8, Teases Upcoming Launch of ‘Mythos-Class Models’

News Room News Room 5 Min Read
News

I Dream of the Day Anime Anthologies Make a Major Comeback

News Room News Room 10 Min Read
News

Supermassive Black Hole Without a Galaxy Changes What We Thought Came First

News Room News Room 6 Min Read
Tech Consumer JournalTech Consumer Journal
Follow US
2024 © Prices.com LLC. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • For Advertisers
  • Contact
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?