Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 6 days ago • 410
Representation over Routing: Overcoming Surrogate Hacking in Multi-Timescale PPO Paper • 2604.13517 • Published 12 days ago • 5
One Sentence, One Drama: Personalized Short-Form Drama Generation via Multi-Agent Systems Paper • 2605.22144 • Published 12 days ago • 10
Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization Paper • 2605.13641 • Published 20 days ago • 50
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 121
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 248