shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-swebench-k5-opus-distill-32k-lr5e6-multiturn Text Generation • 1B • Updated 17 days ago • 311 •
shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-k5-opus-distill-32k-lr5e6-multiturn Text Generation • 1B • Updated 27 days ago • 20
shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-instructions-k10-opus-distill-32k-lr5e6-multiturn Updated about 1 month ago
shubhamrgandhi/qwen3-8b-full-sft-prm-r2egym-instructions-k10-opus-distill-32k-lr5e6-flattened Text Generation • 1B • Updated May 1 • 21
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-multiturn Text Generation • 1B • Updated Apr 29 • 218 •
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-flattened Text Generation • 1B • Updated Apr 28 • 181 •
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean_think Text Generation • 1B • Updated Mar 28 • 2 •
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean Text Generation • 1B • Updated Mar 27 • 1 •
shubhamrgandhi/qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_rejection-sample_think Text Generation • 1B • Updated Mar 27 • 1 •