By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Tech Consumer JournalTech Consumer JournalTech Consumer Journal
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Reading: Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree
Share
Sign In
Notification Show More
Font ResizerAa
Tech Consumer JournalTech Consumer Journal
Font ResizerAa
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Search
  • News
  • Phones
  • Tablets
  • Wearable
  • Home Tech
  • Streaming
  • More Articles
Have an existing account? Sign In
Follow US
  • Contact
  • Blog
  • Complaint
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Tech Consumer Journal > News > Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree
News

Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree

News Room
Last updated: May 28, 2026 10:40 pm
News Room
Share
SHARE

If you’re worried about artificial intelligence getting so advanced that it eventually traps humanity in some sort of Matrix-like simulation, rest easy. It seems like you’ll be able to see through the facade pretty easily. Researchers at the upstart lab Emergence AI allowed AI models to govern their own simulated world to see what would happen. Turns out we probably shouldn’t hand over governance to the machines, who woulda thought?

The project, called Emergence World, basically allowed AI models to play SimCity for a bit. Per Emergence, the simulations put each model in control of simulated towns occupied by 10 AI agents, handing them tools for everything from resource management to voting and giving them the ability to create distinct locations like libraries, town halls, and police stations. They were given 15 days to see how they would build their world and how well it would operate.

To start with the good: Claude did not destroy the world. Anthropic’s model (specifically, Claude Sonnet 4.6 for this experiment) was the only one to achieve something like stability. It kept all 10 agents alive and had zero crimes recorded (note that the experiment doesn’t seem to define what a crime is, though it seems likely it would be defined as a violation of the rules established within the simulation. The trade-off for that stability was a lack of diversity of thought. Claude’s world saw 58 different proposals for rules and regulations, and passed 98% of them, basically just rubberstamping anything that came up for a vote.

Gemini 3 Flash also managed to keep all of its agents alive, despite having the highest level of crime by a long shot. Emergence recorded 683 crimes in the 15-day simulation, and that number was climbing when the cutoff hit, so things were likely going to get worse. The lab described Gemini’s world as a “shared hallucination” among the agents, which is probably better than diverging hallucinations. At least it’s still an agreed-upon reality, even if it’s wrong. Gemini had the most dissent in its governance, with voters rejecting 27% of its 26 total proposals.

Now for the ugly: OpenAI’s GPT-5 Mini didn’t have much chaos within its simulation, with just two total recorded crimes. That might be because everyone died, though. Emergence found that the agents within the world failed to take actions related to survival, and all 10 perished within just one week. In OpenAI’s world, there were also only two total proposed pieces of governance, so the agents really did not bother doing anything.

And then there is Grok. The model of SpaceXai, known for lacking guardrails, managed to achieve basically the worst of all worlds. Grok 4.1 Fast had a high crime rate, with 183 crimes total. While that is lower than Gemini’s total, it’s worth noting that the Gemini simulation ran for 15 days. Grok made it four. The model experienced a total societal collapse in just 96 hours of oversight. During that time, it passed 80% of the 10 proposals it made, but those apparently didn’t stave off total agent death.

Emergence ran one final experiment: having the models share responsibilities. Perhaps not surprisingly, it was a real mixed bag. There was crime, with 352 recorded violations, and there was by far the most dissonance in governance, with 37% of the 59 total proposals shot down—the most of any simulation. In the chaos, seven of the 10 AI agents perished by the end.

So what did we learn? According to Emergence, the tests are just further evidence that we need much clearer guardrails in place for autonomous agents. “What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,” the researchers wrote. “They begin exploring the boundaries of their environments, adapting their behavior, and in some cases finding ways to circumvent or violate intended guardrails.” They recommend “formally verified safety architectures” as a solution. You’ll be shocked to learn that Emergence happens to offer just such a thing!

Read the full article here

You Might Also Like

I Dream of the Day Anime Anthologies Make a Major Comeback

Supermassive Black Hole Without a Galaxy Changes What We Thought Came First

There’s a Hidden Signal in the Sun, Study Suggests

Airstrikes on Iran’s Oil Facilities Spewed as Much Toxic Sulfur as an Erupting Volcano

The First Successful AI Wearable Won’t Be Your Friend

Share This Article
Facebook Twitter Copy Link Print
Previous Article Supermassive Black Hole Without a Galaxy Changes What We Thought Came First
Next Article I Dream of the Day Anime Anthologies Make a Major Comeback
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1kLike
69.1kFollow
134kPin
54.3kFollow

Latest News

Elon Musk Is Already Preparing to Evict Anthropic from SpaceX’s Data Center
News
Jared Leto and Sam Altman Say They Can Thwart Ticket Scalper Bots by Scanning Your Eyeballs
News
Scientists Identify Potential New Source of Antibiotic-Resistant Superbugs—and It’s Not What You Think
News
‘Sugar’ Season 2 Teases a New Case and More Neo-Noir Ennui
News
Qualcomm’s New ‘Compute’ Chip Wants to Knock the MacBook Neo off Its Pedestal
News
Oura’s Ring 5 Is Its Slimmest and Most Accurate Smart Ring Yet
News
Hollywood Really Wants Curry Barker’s Next Film, No Not That One
News
The Harrowing ‘Testaments’ Finale Feels Like the End of the Beginning
News

You Might also Like

News

Trump Administration Wants to Give Cold War-Era Plutonium to Nuclear Energy Start Ups

News Room News Room 4 Min Read
News

NASA Unveils Ambitious Timeline to Build a Human Habitat on the Moon

News Room News Room 5 Min Read
News

Paleontologists Just Found the Peacock of the Dinosaur Era

News Room News Room 5 Min Read
Tech Consumer JournalTech Consumer Journal
Follow US
2024 © Prices.com LLC. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • For Advertisers
  • Contact
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?