sma1-rmarud/olmo3-7b-DPO-original-e2-rlvr-e-attack-stepfinal 7B • Updated about 21 hours ago • 60 • 1
Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models Paper • 2604.16902 • Published Apr 18 • 6
madhusudhan001/qwen2.5-0.5b-materials-science Text Generation • 0.5B • Updated 18 days ago • 485 • • 1
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 324
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published Apr 9 • 290
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 628
Eimhin03/MCV_Fleurs_Combined_Irish_ASR_No_Aug2_No_lrscheduler Automatic Speech Recognition • 72.6M • Updated Apr 8 • 16 • 1
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated Mar 6, 2025 • 260M • • 4.8k
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 351