A Geometric Account of Activation Steering through Angle-Norm Decomposition Paper • 2606.06735 • Published 26 days ago • 26
MCompassRAG: Topic Metadata as a Semantic Compass for Paragraph-Level Retrieval Paper • 2606.18508 • Published 14 days ago • 21
DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels Paper • 2602.11715 • Published Feb 12 • 8
DICE Collection A series of diffusion language models tailored for CUDA kernel generation. • 4 items • Updated Feb 13 • 3
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 107 items • Updated about 13 hours ago • 733
lmstudio-community/Qwen3-4B-Thinking-2507-MLX-8bit Text Generation • 1B • Updated Aug 6, 2025 • 50.3k • 8
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22, 2025 • 736k • • 1.32k
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published Mar 5, 2025 • 234
Vikhrmodels/Vikhr-Qwen-2.5-1.5B-Instruct-MLX_8bit Text Generation • 0.4B • Updated Nov 26, 2024 • 66 • 8