-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • 33B • Updated • 181 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 687 • 34 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 631 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 301
AI & ML interests
Scale up the Reasoner-Zero Training
-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • 33B • Updated • 181 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 687 • 34 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 631 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 301