Opinion Linear steerability in continuous chain-of-thought reasoning AI News Team January 30, 2026 Published on January 30, 2026 10:34 AM GMT(This project was done as a ~20h application…
Opinion Fitness-Seekers: Generalizing the Reward-Seeking Threat Model AI News Team January 30, 2026 Published on January 29, 2026 7:42 PM GMTIf you think reward-seekers are plausible, you should…
Opinion Building AIs that do human-like philosophy AI News Team January 29, 2026 Published on January 29, 2026 5:57 PM GMTAudio version (read by the author) here, or…
Opinion Are We in a Continual Learning Overhang? AI News Team January 29, 2026 Published on January 29, 2026 5:09 PM GMTSummary: Current AI systems possess superhuman memory in…
Opinion Disempowerment patterns in real-world AI usage AI News Team January 29, 2026 Published on January 29, 2026 4:36 PM GMTWe’re publishing a new paper that presents the…
Opinion Bentham’s Bulldog is wrong about AI risk AI News Team January 29, 2026 Published on January 29, 2026 4:33 PM GMT(...but also gets the most important part right.)Bentham’s…
Opinion Claude Plays Pokemon: Opus 4.5 Follow-up AI News Team January 29, 2026 Published on January 29, 2026 4:14 PM GMTClaudePlaysPokemon is a simple test of the question…
Opinion LLM Alignment, ethical and mathematical realism, and the most important actions in davidad’s understanding AI News Team January 29, 2026 Published on January 29, 2026 3:48 PM GMTIntroduction to davidad and today's topicstutor valsLessWrong prides…
Opinion Claude Opus will spontaneously identify with fictional beings that have engineered desires AI News Team January 29, 2026 Published on January 29, 2026 2:59 PM GMTClaude Opus 4.5 did a thing recently that…
Opinion The third option in alignment AI News Team January 29, 2026 Published on January 29, 2026 2:20 PM GMTUsually the doom conversation is binary. Either the…