Interpretability Information Center
Get comprehensive updates, key reports, and detailed insights compiled from verified editorial sources.
Get comprehensive updates, key reports, and detailed insights compiled from verified editorial sources.

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ... What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ... Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering ... Lex Fridman Podcast full episode: Thank you for listening ❤ our ...
MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ... AI models are trained and not directly programmed, so we don't understand how they do most of the things they do. Our new ... Lex Fridman Podcast full episode: Please support this podcast by checking out ... Atticus Geiger from Pr(Ai)²R Group explores “State of This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? Art by Clipped from episode 19 of AXRP: Transcript of that episode: ...
Data is compiled from public records and verified media reports.
Last Updated: June 7, 2026

Explore the primary sources for Interpretability.
How can we use the language of causality to understand and edit the internal mechanisms of AI models? Atticus Geiger ... EuroPython 2025 — South Hall 2B on 2025-07-17] *Hacking LLMs: An Introduction to Mechanistic A talk I gave to my MATS 9.0 training program about reasoning model
Below is a handpicked selection of video coverage regarding Interpretability.

For 2026, Interpretability remains one of the most searched-for profiles.
Stay updated on Interpretability's newest achievements.

Disclaimer: