3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code Paper • 2606.01057 • Published 18 days ago • 8
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning Paper • 2606.13673 • Published 7 days ago • 97
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 7 days ago • 135
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs Paper • 2606.06574 • Published 14 days ago • 18
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 22 days ago • 91
Sandboxed Coding Agents are Competitive Omni-modal Task Solvers Paper • 2606.00579 • Published 18 days ago • 2
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints Paper • 2606.05622 • Published 14 days ago • 40
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories Paper • 2605.21468 • Published 29 days ago • 50