inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt1 0.6B • Updated about 9 hours ago
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt0 0.6B • Updated 2 days ago • 106
inference-optimization/Llama-4-Scout-1.7B-0.4B-Instruct Image-Text-to-Text • 2B • Updated 4 days ago • 22
inference-optimization/ctest-Qwen3.5-9B-sliding-window-all-speculator.dflash 2B • Updated 4 days ago • 39
inference-optimization/ctest-Qwen3.5-9B-sliding-window-speculator.dflash 2B • Updated 5 days ago • 56
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 18 days ago • 203
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated 18 days ago • 182
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 19 days ago • 225
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-noise Image-Text-to-Text • 32B • Updated 19 days ago • 134
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-hybrid Image-Text-to-Text • 32B • Updated 19 days ago • 130
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-heuristic Image-Text-to-Text • 32B • Updated 19 days ago • 160
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-noise Image-Text-to-Text • 30B • Updated 19 days ago • 133
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-hybrid Image-Text-to-Text • 30B • Updated 19 days ago • 118
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-heuristic Image-Text-to-Text • 30B • Updated 19 days ago • 112