·
AI & ML interests
Data Science, ML
Organizations
upvoted an article about 1 year ago view article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies
prithivMLmods
• • 29
view article Open-R1: a fully open reproduction of DeepSeek-R1


- +1
eliebak, lvwerra, lewtun
• • 889
upvoted a paper over 1 year ago