Steering from Scratch
Minimal, well-documented reference implementation of activation steering for LLMs (GPT-2 and Qwen2.5), including PCA-based layer visualization to identify optimal steering targets. Designed as a companion to the DISCO paper; demonstrates the core mechanics cleanly enough to serve as a teaching resource.