Opinion LLMs and Literature: Where Value Actually Comes From AI News Team February 21, 2026 Published on February 21, 2026 1:16 PM GMTCross-posted from my Substack. I’m interested in pushback…
Opinion The Spectre haunting the “AI Safety” Community AI News Team February 21, 2026 Published on February 21, 2026 11:14 AM GMTI’m the originator behind ControlAI’s Direct Institutional Plan…
Opinion LessWrong’s goals overlap HowTruthful’s AI News Team February 21, 2026 Published on February 21, 2026 4:19 AM GMTOn my personal website I have a link…
Opinion Alignment to Evil AI News Team February 21, 2026 Published on February 21, 2026 3:29 AM GMTOne seemingly-necessary condition for a research organization that…
Opinion Reporting Tasks as Reward-Hackable: Better Than Inoculation Prompting? AI News Team February 21, 2026 Published on February 21, 2026 1:59 AM GMTEpistemic status: untested but seems plausibleTL;DR: making honesty…
Opinion Robert Sapolsky Is Simply Not Talking About Compatibilism AI News Team February 21, 2026 Published on February 21, 2026 1:27 AM GMTImagine someone wrote a 500-page book called Taking…
Opinion TT Self Study Journal # 7 AI News Team February 21, 2026 Published on February 21, 2026 1:22 AM GMT[Epistemic Status: This is an artifact of my…
Opinion How will we do SFT on models with opaque reasoning? AI News Team February 21, 2026 Published on February 21, 2026 12:00 AM GMTCurrent LLMs externalize lots of their reasoning in…
Opinion Hodoscope: Visualization for Efficient Human Supervision AI News Team February 21, 2026 Published on February 20, 2026 11:41 PM GMTThis is a link post for our recent…
Opinion Carrot-Parsnip: A Social Deduction Game for LLM Evals AI News Team February 21, 2026 Published on February 20, 2026 11:06 PM GMTSocial Deduction games (SD games) are a class…