Opinion ARENA 7.0 Impact Report AI News Team March 20, 2026 The impact report from ARENA’s previous iteration, ARENA 6.0, is available here.Summary:ARENA 7.0 took place…
Opinion The Federal AI Policy Framework: An Improvement, But My Offer Is (Still Almost) Nothing AI News Team March 20, 2026 The Federal AI Policy Framework has been released. Well, it is a four page outline.…
Opinion Confusion around the term reward hacking AI News Team March 20, 2026 Summary: "Reward hacking" commonly refers to two different phenomena: misspecified-reward exploitation, where RL reinforces undesired…
Opinion The Distaff Texts AI News Team March 20, 2026 Though I spend most of my time studying what is labelled “history” in some manuscripts…
Opinion Untrusted Monitoring is Default; Trusted Monitoring is not AI News Team March 20, 2026 These views are my own and not necessarily representative of those of any colleagues with…
Opinion Why I am not buying IPv4 addresses as an investment AI News Team March 20, 2026 2026-01-17 Disclaimer Quick Note I did the actual research back in 2024-10. I polished the…
Opinion Positive-sum interactions between players with linear utility in resources AI News Team March 20, 2026 Sometimes people say things like "If the humans and AIs have linear utility in resources,…
Opinion No, we haven’t uploaded a fly yet AI News Team March 20, 2026 In the last two weeks, social media was set abuzz by claims that scientists had…
Opinion The Case for Low-Competence ASI Failure Scenarios AI News Team March 20, 2026 I think the community underinvests in the exploration of extremely-low-competence AGI/ASI failure modes and explain…
Opinion A List of Research Directions in Character Training AI News Team March 20, 2026 Thanks to Rohan Subramani, Ariana Azarbal, and Shubhorup Biswas for proposing some of the ideas…