ch-ai-tanya model-psychology LLM wiki

Claude Can Identify Its Intrusive Thoughts

Transformer News ·Transformer News (Substack) ·Jan 31, 2025

News coverage of Lindsey et al.'s introspection study. Includes Jack Lindsey's quote that the key finding was the model noticing "there is an injected concept in the first place," not merely concept identification. Provides accessible framing of the technical results.

cited in