Ask a frozen LM what it is thinking, in plain English.
Estimate VRAM needs and speed for LLMs on any GPU
Translate a passage across 961 discourse communities
Steer the model outputs with an activation delta.
Detect LLM generated text.
Per-token reflexivity heatmap from a frozen Qwen2.5-7B