Opinion OpenAI’s red line for AI self-improvement is fundamentally flawed AI News Team May 2, 2026 TL;DR. OpenAI's "Critical" threshold for AI self-improvement in the Preparedness Framework v2 has three structural…
Opinion Psychopathy: The Problem AI News Team May 2, 2026 Why we need a new framework for understanding psychopathy, narcissism, and related presentations.This is the…
Opinion Games that change your mind AI News Team May 2, 2026 Some things you might learn from games are pretty blatant: Trivial Pursuit might teach you…
Opinion Human-looking robots are a bad idea AI News Team May 2, 2026 epistemic status: opinionated view on the dangers of robots that look like humansIt's not a…
Opinion How Go Players Disempower Themselves to AI AI News Team May 2, 2026 Written as part of the MATS 9.1 extension program, mentored by Richard Ngo.From March 9th…
Industry Replit’s Amjad Masad on the Cursor deal, fighting Apple, and why he’d rather not sell AI News Team May 2, 2026 At TechCrunch's sold-out StrictlyVC event in San Francisco on Thursday night, we covered a lot…
Industry Meta buys robotics startup to bolster its humanoid AI ambitions AI News Team May 2, 2026 Meta bought humanoid startup Assured Robot Intelligence to beef up its AI models for robots,…
Academic Musk v. Altman week 1: Elon Musk says he was duped, warns AI could kill us all, and admits that xAI distills OpenAI’s models AI News Team May 2, 2026 In the first week of the landmark trial between Elon Musk and OpenAI, Musk took…
Opinion Early-stage empirical work on “spillway motivations” AI News Team May 2, 2026 Previously, we proposed spillway motivations as a way to mitigate misalignment induced via training a model…
Opinion Exploration Hacking: Can LLMs Learn to Resist RL Training? AI News Team May 2, 2026 We empirically investigate exploration hacking (EH) — where models strategically alter their exploration to resist…