Opinion Human-looking robots are a bad idea AI News Team May 2, 2026 epistemic status: opinionated view on the dangers of robots that look like humansIt's not a…
Opinion How Go Players Disempower Themselves to AI AI News Team May 2, 2026 Written as part of the MATS 9.1 extension program, mentored by Richard Ngo.From March 9th…
Opinion Early-stage empirical work on “spillway motivations” AI News Team May 2, 2026 Previously, we proposed spillway motivations as a way to mitigate misalignment induced via training a model…
Opinion Exploration Hacking: Can LLMs Learn to Resist RL Training? AI News Team May 2, 2026 We empirically investigate exploration hacking (EH) — where models strategically alter their exploration to resist…
Opinion Conditional misalignment: Mitigations can hide EM behind contextual cues AI News Team May 2, 2026 This is the abstract, introduction, and discussion of our new paper. We study three popular mitigations…
Opinion Ambitious Mech Interp w/ Tensor-transformers on toy languages [Project Proposal] AI News Team May 2, 2026 This is my project proposal for Pivotal. Apply as a mentee by May 3rdThe field…
Opinion Your four-dimensional body AI News Team May 1, 2026 IntroductionYou are a four-dimensional being.And it’s fitting, because you live in a four-dimensional world. Three…
Opinion Housing Roundup #14: You Can’t Build That AI News Team May 1, 2026 Why can’t you build it? Because you aren’t allowed to build it. Not in the…
Opinion What do Russian olympiad winners think of HPMOR? Our data AI News Team May 1, 2026 I gave Claude access to our stats and asked it to generate a page presenting…
Opinion Reflections on InkHaven AI News Team May 1, 2026 If you’ve been wondering why I’m suddenly blogging every day… well, it’s about to stop!…