Opinion Understanding when and why agents scheme AI News Team March 22, 2026 TL;DRTo understanding the conditions under which LLM agents engage in scheming behavior, we develop a…
Opinion China Derangement Syndrome AI News Team March 22, 2026 Often I see people claim it’s essential for America to win the AI race against…
Opinion China declares AGI development to be a part of 5-year plan AI News Team March 21, 2026 The CCP writes in its 15th 5-year plan that it will.Encourage innovation in multimodal, agentic,…
Opinion Utrecht Meetup #2, Making Beliefs Pay Rent AI News Team March 21, 2026 Follow-up to Utrecht Meet & Greet. Let's see if we can get our hands dirty.Excited…
Opinion Grounding Coding Agents via Dixit AI News Team March 21, 2026 [Epistemic status: ideas in this post are mine. I've published them previously in the form…
Opinion The Hot Mess Paper Conflates Three Distinct Failure Modes AI News Team March 21, 2026 High-level summary:Anthropic's recent "Hot Mess of AI" paper makes an important empirical observation: as models…
Opinion The Future of Aligning Deep Learning systems will probably look like “training on interp” AI News Team March 21, 2026 Epistemic Status: I think this is right, but a lot of this is empirical, and…
Opinion An agent autonomously builds a 1.5 GHz Linux-capable RISC-V CPU AI News Team March 21, 2026 A project from Verkor, a chip design startup. "Verkor is working with multiple of the…
Opinion Untrusted monitoring: extra bits AI News Team March 21, 2026 The following are some further notes related to untrusted monitoring I had while working on…
Opinion Finding features in Transformers: Contrastive directions elicit stronger low-level perturbation responses than baselines AI News Team March 21, 2026 Note: This is a research update sharing preliminary results as part of ongoing work.Figure 1:…