Trained checkpoints for the paper "From Interpretability to Performance: Optimizing Retrieval Heads for Long-Context Language Models"
Youmi Ma
maym15
AI & ML interests
None yet
Recent Activity
published a model 2 days ago
maym15/Olmo-3-7B-Think-RetMask published a model 2 days ago
maym15/Olmo-3-7B-Instruct-RetMask published a model 2 days ago
maym15/Qwen3-8B-RetMask