Opinion LLM Misalignment Can be One Gradient Step Away, and Blackbox Evaluation Cannot Detect It. AI News Team March 15, 2026 Models that appear aligned under black-box evaluation may conceal substantial latent misalignment beneath their observable…
Opinion Bridge Thinking and Wall Thinking AI News Team March 15, 2026 There are a couple of frames I find useful when understanding why different people talk…
Opinion Safe AI Germany (SAIGE) AI News Team March 15, 2026 TL;DR: SAIGE is a national research and field-building initiative, started in January 2026. We believe…
Opinion Self-Recognition Finetuning can Reverse and Prevent Emergent Misalignment AI News Team March 15, 2026 TL;DREmergent Misalignment (EM) is correlated with model identity, we find two pieces of evidence for…
Opinion Mini-Munich Succeeds Where KidZania Fails AI News Team March 15, 2026 This post is part of a larger exploration (not yet finished, but you can follow…
Opinion Optimal (And Ethical?) Methods To Find “Optimal Running” AI News Team March 15, 2026 Epistemic Status: The central quote of this essay is just pure slop, of course. But…
Opinion ‘Staying with it’ Done Wrong AI News Team March 15, 2026 I was meditating today and noticed quite some over-effort happening. So I did the diligent,…
Industry US Army announces contract with Anduril worth up to $20B AI News Team March 15, 2026 The Army described this as a single enterprise contract consolidating more than 120 separate "procurement…
Opinion Forecasting Dojo Meetup – postmortem discussion. AI News Team March 15, 2026 Hi Everyone, The next meetup of the forecasting practice group is here! Next week we're…
Industry Meta reportedly considering layoffs that could affect 20% of the company AI News Team March 14, 2026 These layoffs could help Facebook's parent company offset its aggressive spending on AI infrastructure, as…