Opinion Making LLM Graders Consistent AI News Team January 13, 2026 Published on January 13, 2026 3:32 AM GMTGetting LLMs to be deterministic when scoring the…
Opinion Attempting to influence transformer representations via initialization AI News Team January 13, 2026 Published on January 13, 2026 12:49 AM GMTTL;DROne major obstacle to interpretability is that complicated…
Opinion Brief Explorations in LLM Value Rankings AI News Team January 12, 2026 Published on January 12, 2026 6:16 PM GMTCode and data can be found hereExecutive SummaryWe use…
Opinion Brief Explorations in LLM Value Rankings AI News Team January 12, 2026 Published on January 12, 2026 6:16 PM GMTCode and data can be found hereExecutive SummaryWe use…
Opinion Practical challenges of control monitoring in frontier AI deployments AI News Team January 12, 2026 Published on January 12, 2026 4:45 PM GMTTL;DR: We wrote a safety case sketch for…
Opinion Practical challenges of control monitoring in frontier AI deployments AI News Team January 12, 2026 Published on January 12, 2026 4:45 PM GMTTL;DR: We wrote a safety case sketch for…
Opinion Thinking vs Unfolding AI News Team January 12, 2026 Published on January 12, 2026 3:26 PM GMTJake vs BossMy friend Jake has a difficult…
Opinion Split Personality Training: Revealing Latent Knowledge Through Alternate Personalities (Research Report) AI News Team January 12, 2026 Published on January 12, 2026 12:29 PM GMTSplit Personality Training: Revealing Latent Knowledge Through Alternate…
Opinion Inter-branch communication in the multiverse via trapped ions AI News Team January 12, 2026 Published on January 12, 2026 12:16 PM GMTIn the article "“Proposal for an experimental test…
Opinion –dangerously-skip-permissions AI News Team January 12, 2026 Published on January 12, 2026 7:37 AM GMTI noticed that some AI-safety-focused people are very…