The ToolRL model trained for tool use through GRPO
Cheng Qian
chengq9
AI & ML interests
Agent, Tool Learning
Recent Activity
upvoted a paper 5 days ago
Trimming the Long-Tail of Visual World Modeling Evaluation upvoted a paper 29 days ago
Brick-Composer: Using MLLMs for Assembly with Diverse Bricks upvoted a paper about 1 month ago
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints