1155

Avi

avahal

AI & ML interests

LLMs

Recent Activity

commentedon a paper about 3 hours ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

commentedon a paper about 3 hours ago

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

commentedon a paper about 3 hours ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

View all activity

Organizations

None yet

commented 4 papers about 3 hours ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 2 days ago • 32 •

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

Paper • 2606.06492 • Published 2 days ago • 44 •

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

Paper • 2606.04743 • Published 3 days ago • 36 •

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

Paper • 2606.05553 • Published 2 days ago • 40 •

commented 4 papers 1 day ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 5 days ago • 49 •

Trust Region On-Policy Distillation

Paper • 2606.01249 • Published 6 days ago • 41 •

From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain

Paper • 2605.23895 • Published 15 days ago • 51 •

OCC-RAG: Optimal Cognitive Core for Faithful Question Answering

Paper • 2606.00683 • Published 7 days ago • 83 •

commented 8 papers 2 days ago

A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks

Paper • 2605.28556 • Published 10 days ago • 63 •

Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding

Paper • 2605.29707 • Published 9 days ago • 139 •

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 5 days ago • 174 •

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published 9 days ago • 189 •

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 8 days ago • 56 •

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published 8 days ago • 64 •

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 9 days ago • 102 •

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 8 days ago • 105 •

commented 4 papers 9 days ago

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

Paper • 2605.26114 • Published 12 days ago • 64 •

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Paper • 2605.27367 • Published 11 days ago • 71 •

EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

Paper • 2605.23271 • Published 15 days ago • 79 •

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 11 days ago • 138 •

Avi

AI & ML interests

Recent Activity

Organizations

avahal's activity