Quantization of models designed to fit within the memory constraints of 2x Strix Halo machines. Can also be ran on any generic hardware using vLLM.
🏗️ Building on HF
Sasha
ayysasha
AI & ML interests
None yet
Recent Activity
new activity 2 days ago
z-lab/MiniMax-M2.7-DFlash:Please accept our requests new activity about 1 month ago
z-lab/MiniMax-M2.7-DFlash:Please kindly approve my request to access this model updated a collection about 1 month ago
Dual Strix Halo QuantsOrganizations
None yet