1.5M ratings
277k ratings
JΛMΞS / Blog
anthropic.com

Tracing the thoughts of a large language model

Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms
Apr 25th, 2025