John's picture

John

koifish12

·

AI & ML interests

None yet

Recent Activity

new activity 6 days ago

unsloth/Qwen3.6-35B-A3B-MTP-GGUF:Fast!

new activity 8 days ago

unsloth/Qwen3.6-35B-A3B-NVFP4:Vllm - Out of Memory Error

new activity 9 days ago

z-lab/Qwen3.5-35B-A3B-PARO:can you guys do the 3.6 version?

View all activity

Organizations

None yet

New activity in unsloth/Qwen3.6-35B-A3B-MTP-GGUF 6 days ago

Fast!

#10 opened 8 days ago by

New activity in unsloth/Qwen3.6-35B-A3B-NVFP4 8 days ago

Vllm - Out of Memory Error

#1 opened 16 days ago by

New activity in z-lab/Qwen3.5-35B-A3B-PARO 9 days ago

can you guys do the 3.6 version?

#1 opened 9 days ago by

New activity in trevon/Qwen3.5-27B-MLX-MTP 11 days ago

could you do a 4bit version?

#1 opened 16 days ago by

New activity in z-lab/Qwen3.6-27B-PARO 15 days ago

would this work with mtp?

#1 opened 15 days ago by

New activity in z-lab/Qwen3.6-35B-A3B-DFlash about 1 month ago

thanks for the great work

#1 opened about 1 month ago by

New activity in mlx-community/gemma-4-31b-it-nvfp4 about 1 month ago

model looping during coding

#2 opened about 1 month ago by

New activity in AdrienBrault/Nemotron-Cascade-2-30B-A3B-5bit-MLX 2 months ago

could you create a 8bit mlx please?

#1 opened 2 months ago by

New activity in amd/Qwen3.5-397B-A17B-MXFP4 2 months ago

can you guys do qwen3.5 35b a3b and also the 27b variant?

#5 opened 2 months ago by

New activity in ubergarm/GLM-4.7-Flash-GGUF 4 months ago

question about mxfp4

#3 opened 4 months ago by

New activity in unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF 5 months ago

Please update llama.cpp to see improved performance!

#7 opened 5 months ago by

New activity in nvidia/gpt-oss-120b-Eagle3-throughput 5 months ago

how would you run this with llamacpp?

#1 opened 5 months ago by

New activity in unsloth/Qwen3-Coder-REAP-363B-A35B-GGUF 7 months ago

can we also get the quants for the smaller qwen3 coder and glm 4.5 air reaps?

#2 opened 7 months ago by

New activity in cerebras/Kimi-Linear-REAP-35B-A3B-Instruct 7 months ago

Why not experiment?

#1 opened 7 months ago by

New activity in unsloth/Qwen3-30B-A3B-Instruct-2507-GGUF 7 months ago

Qwen3-30B-A3B-Instruct-2507-UD-Q2_K_XL.gguf output garbled

#8 opened 8 months ago by

New activity in nvidia/parakeet-tdt-0.6b-v3 7 months ago

Streaming question

#18 opened 7 months ago by