2026/3/1

Paper Walkthroughs

A Unified Theory of Sparse Dictionary Learning in Mechanistic Interpretability: Piecewise Biconvexity and Spurious Minima

The first unified theoretical framework for Sparse Dictionary Learning in mechanistic interpretability, explaining why SDL methods suffer from dead neurons, polysemanticity, and feature absorption—and providing a principled fix grounded in piecewise biconvexity and non-identifiability.

Read More
2026/3/1

Paper Walkthroughs

Modality Gap–Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

A walkthrough of a paper that goes beyond the isotropic assumption to formally decompose the modality gap in CLIP-style models into a Stable Bias and Anisotropic Residuals. The paper introduces ReAlign, a closed-form alignment strategy, and ReVision, a training pipeline enabling text-only pretraining for MLLMs.

Read More
2026/1/19

Paper Walkthroughs

BrainExplore: Large-Scale Discovery of Interpretable Visual Representations in the Human Brain

An automated framework that discovers thousands of interpretable visual concepts encoded across the human visual cortex—from "hands holding objects" to "forest scenes"—revealing fine-grained brain representations previously unreported.

Read More
2025/12/16

Paper Walkthroughs

The Evolution of Multimodal Model Architectures: A Taxonomy

A comprehensive walkthrough of Wadekar et al.'s taxonomy of multimodal architectures. Discover the four distinct architectural patterns—Type-A through Type-D—that define how models like GPT-4V, LLaVA, and Gemini combine vision, language, and other modalities.

Read More
2025/12/15

Paper Walkthroughs

The Linear Representation Hypothesis and the Geometry of Large Language Models

A walkthrough of Park et al.'s ICML 2024 paper on the linear representation hypothesis. We explore how high-level concepts are represented as linear directions in language model representation spaces, with formal definitions and theoretical foundations.

Read More
2026/3/22

云烟遥寄

烟云生细雨,雨落尘土里。偶尔作人形,不时看风景。风景光怪陆离,恍恍好似梦境。梦里若有所思,只当云烟遥寄。

Read More