·
AI & ML interests
LLM post-training
Organizations
ydeng9/OpenVLThinker-grpo-hard
Viewer
• Updated • 6.25k • 27
• 1
ydeng9/OpenVLThinker-grpo-medium
Viewer
• Updated • 3.3k • 14
Viewer
• Updated • 960 • 4
Viewer
• Updated • 2.3k • 9
Viewer
• Updated • 82.8k • 12
Viewer
• Updated • 1.76k • 8
Viewer
• Updated • 1.32k • 9
Viewer
• Updated • 789 • 8
Viewer
• Updated • 6 • 12
ydeng9/swe-smith-rl-distill
Viewer
• Updated • 7.81k • 13
ydeng9/OpenVLThinker-sft-iter3
Viewer
• Updated • 3.28k • 22
ydeng9/OpenVLThinker_sft_iter2
Viewer
• Updated • 5.54k • 6
ydeng9/captioned-data-subsetv1
Viewer
• Updated • 59.3k • 19
Viewer
• Updated • 3.11k • 96
• 1
Viewer
• Updated • 5.87k • 285
• 1