EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions Paper • 2606.23654 • Published 6 days ago • 80
PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems Paper • 2606.22388 • Published 7 days ago • 95
ShutterMuse: Capture-Time Photography Guidance with MLLMs Paper • 2606.25763 • Published 4 days ago • 44
Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation Paper • 2606.26907 • Published 3 days ago • 41
KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking Paper • 2606.22807 • Published 6 days ago • 47
Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models Paper • 2606.25041 • Published 5 days ago • 91
MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision Paper • 2606.17162 • Published 13 days ago • 159
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 5 days ago • 136
MinerU-Popo: Universal Post-Processing Model for Structured Document Parsing Paper • 2605.24973 • Published May 24 • 1
view article Article Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World +3 daniel-treble, whojavumusic, alessia-treble, georg-goetz, bezzam • 4 days ago • 5
view article Article PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters PaddlePaddle • 6 days ago • 26
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 12 days ago • 207
JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 18 days ago • 204
view article Article MosaicLeaks: Can your research agent keep a secret? ServiceNow • 10 days ago • 12