AI News

New ways to learn math and science in ChatGPT

AI News Team March 10, 2026

ChatGPT introduces interactive visual explanations for math and science, helping students explore formulas, variables, and…

Industry

Yann LeCun’s AMI Labs raises $1.03 billion to build world models

AI News Team March 10, 2026

AMI Labs, the new venture cofounded by Turing Prize winner Yann LeCun after he left…

Opinion

Immortality: A Beginner’s Guide (Part 2)

AI News Team March 10, 2026

This is the second post in my chain of reflections on immortality, where I will…

Industry

OpenAI and Google employees rush to Anthropic’s defense in DOD lawsuit

AI News Team March 10, 2026

More than 30 OpenAI and Google DeepMind employees signed onto a statement supporting Anthropic's lawsuit…

Opinion

Investigating encoded reasoning in LLMs

AI News Team March 10, 2026

Epistemic status: This work was done as a 1-week capstone project for ARENA. It highlights…

Industry

Anthropic launches code review tool to check flood of AI-generated code

AI News Team March 10, 2026

Anthropic launched Code Review in Claude Code, a multi-agent system that automatically analyzes AI-generated code,…

Opinion

Claude Code, Claude Cowork and Codex #5

AI News Team March 10, 2026

It feels good to get back to some of the fun stuff. The comments here…

Opinion

Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation

AI News Team March 10, 2026

TL;DR: We introduce a testbed based on censored Chinese LLMs, which serve as natural objects…

Opinion

Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation

AI News Team March 10, 2026

TL;DR: We introduce a testbed based on censored Chinese LLMs, which serve as natural objects…

Opinion

Emergent Misalignment and the Anthropic Dispute

AI News Team March 10, 2026

TL;DR: We think allowing frontier AI models to be used for mass domestic surveillance and…

Emergent misalignment evident in activations at low poisoning doses – long before behavioral checks flag it

Massapequa ACX Meetup

Retrospective on my unsupervised elicitation challenge

Emergent misalignment evident in activations at low poisoning doses – long before behavioral checks flag it

Massapequa ACX Meetup

Retrospective on my unsupervised elicitation challenge

Recent Articles

Emergent misalignment evident in activations at low poisoning doses – long before behavioral checks flag it

Massapequa ACX Meetup

Retrospective on my unsupervised elicitation challenge

Alignment Faking Replication and Chain-of-Thought Monitoring Extensions

Training a Transformer to Compose One Step Per Layer (and Proving It)

New ways to learn math and science in ChatGPT

Yann LeCun’s AMI Labs raises $1.03 billion to build world models

Immortality: A Beginner’s Guide (Part 2)

OpenAI and Google employees rush to Anthropic’s defense in DOD lawsuit

Investigating encoded reasoning in LLMs

Anthropic launches code review tool to check flood of AI-generated code

Claude Code, Claude Cowork and Codex #5

Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation

Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation

Emergent Misalignment and the Anthropic Dispute