Shekswess
/

tiny-think-dpo-math-stem-apo_zero-beta0_5-lr3e-6-e1-bs8

Text Generation

Generated from Trainer

Model card Files Files and versions

tiny-think-dpo-math-stem-apo_zero-beta0_5-lr3e-6-e1-bs8

Commit History

Update README.md

9226524
verified

Shekswess commited on Jan 28

Training in progress, step 358

d1bb1db
verified

Shekswess commited on Jan 18

initial commit

83b4ba7
verified

Shekswess commited on Jan 18