Explorer
Interpret Features
Local LLM Required
Ensure Ollama is running:
Ensure Ollama is running:
ollama serve
Methodology
This process uses the Auto-Interpretability method described by O'Neill et al. (2024).
It feeds the LLM with a contrastive set of Top-K activating documents versus Random non-activating documents to distill the semantic meaning of each latent feature.