Exploration and Exploitation Errors Are Measurable for Language Model Agents Paper • 2604.13151 • Published 27 days ago • 24
changdae/vittle-llavabench-coco-textual-perturbed Viewer • Updated about 1 month ago • 30 • 100
changdae/vittle-llavabench-coco-textual-perturbed Viewer • Updated about 1 month ago • 30 • 100
changdae/vittle-llavabench-coco-visual-perturbed Viewer • Updated about 1 month ago • 270 • 292
changdae/vittle-llavabench-coco-visual-perturbed Viewer • Updated about 1 month ago • 270 • 292
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated Mar 15 • 1.12M • 730
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 Text Generation • 32B • Updated Mar 15 • 49.5k • 125