1151

Avi

avahal

AI & ML interests

LLMs

Recent Activity

commentedon a paper about 3 hours ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

commentedon a paper about 5 hours ago

Trust Region On-Policy Distillation

commentedon a paper about 5 hours ago

From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain

View all activity

Organizations

None yet

commented a paper about 3 hours ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 4 days ago • 40 •

commented 3 papers about 5 hours ago

Trust Region On-Policy Distillation

Paper • 2606.01249 • Published 5 days ago • 37 •

From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain

Paper • 2605.23895 • Published 14 days ago • 50 •

OCC-RAG: Optimal Cognitive Core for Faithful Question Answering

Paper • 2606.00683 • Published 6 days ago • 81 •

commented 8 papers about 15 hours ago

A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks

Paper • 2605.28556 • Published 9 days ago • 61 •

Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding

Paper • 2605.29707 • Published 8 days ago • 135 •

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 4 days ago • 168 •

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published 8 days ago • 184 •

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 7 days ago • 56 •

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published 7 days ago • 64 •

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 8 days ago • 98 •

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 7 days ago • 104 •

commented 4 papers 8 days ago

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

Paper • 2605.26114 • Published 11 days ago • 64 •

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Paper • 2605.27367 • Published 10 days ago • 71 •

EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

Paper • 2605.23271 • Published 14 days ago • 79 •

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 10 days ago • 137 •

commented 4 papers 9 days ago

Seeing the Needle in the Haystack: Towards Weakly-Supervised Log Instance Anomaly Localization via Counterfactual Perturbation

Paper • 2605.10988 • Published 27 days ago • 3 •

Decoding the Critique Mechanism in Large Reasoning Models

Paper • 2603.16331 • Published 14 days ago •

ClaimDiff-RL: Fine-Grained Caption Reinforcement Learning through Visual Claim Comparison

Paper • 2605.20278 • Published 12 days ago • 1 •

Pixel-Level Pavement Distress Assessment Using Instance Segmentation

Paper • 2605.26095 • Published 11 days ago •

Avi

AI & ML interests

Recent Activity

Organizations

avahal's activity