In a world where machines and humans are increasingly intertwined, Gillian Hadfield is focused on ensuring that artificial intelligence follows the norms that make human societies thrive. "The ...
AI researchers from OpenAI, Google DeepMind, Anthropic, and a broad coalition of companies and nonprofit groups, are calling for deeper investigation into techniques for monitoring the so-called ...
This article is part of TPM Cafe, TPM’s home for opinion and news analysis. It was originally published at The Conversation. The AI chatbot Grok spent one day in May 2025 spreading debunked conspiracy ...
The Greek myth of King Midas is a parable of hubris: seeking fabulous wealth, the king is granted the power to turn all he touches to solid gold--but this includes, tragically, his food and his ...
Someone altered the AI chatbot Grok to make it produce antisemitic text and a debunked conspiracy theory. Cheng Xin/Getty Images The AI chatbot Grok went on an antisemitic rant on July 8, 2025, ...
Leading artificial intelligence systems are starting to show troubling tendencies that mirror unethical behaviors of humans. A new wave of research from Anthropic shows that top AI models will opt for ...
Every now and then, researchers at the biggest tech companies drop a bombshell. There was the time Google said its latest quantum chip indicated multiple universes exist. Or when Anthropic gave its AI ...
OpenAI researchers tried to train the company’s AI to stop “scheming” — a term the company defines as meaning “when an AI behaves one way on the surface while hiding its true goals” — but their ...
Several frontier AI models show signs of scheming. Anti-scheming training reduced misbehavior in some models. Models know they're being tested, which complicates results. New joint safety testing from ...