122 160

Basit mustafa

BasitMustafa

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

upvoted a paper about 19 hours ago

X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

upvoted a paper about 20 hours ago

Streaming Communication in Multi-Agent Reasoning

View all activity

Organizations

upvoted a paper about 15 hours ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 7 days ago • 46

upvoted a paper about 19 hours ago

X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

Paper • 2606.02482 • Published 7 days ago • 34

upvoted a paper about 20 hours ago

Streaming Communication in Multi-Agent Reasoning

Paper • 2606.05158 • Published 5 days ago • 29

upvoted 2 papers 1 day ago

Rethinking Continual Experience Internalization for Self-Evolving LLM Agents

Paper • 2606.04703 • Published 5 days ago • 18

Benchmarks are Not Enough: RAMP for Runtime Assessing of Agentic Models in Production Systems

Paper • 2605.27492 • Published 13 days ago • 24

upvoted a paper 2 days ago

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

Paper • 2606.06492 • Published 4 days ago • 66

liked a model 2 days ago

google/magenta-realtime-2

Text-to-Audio • Updated 3 days ago • 13.3k • 123

upvoted a paper 2 days ago

Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching

Paper • 2606.03577 • Published 6 days ago • 15

liked a model 3 days ago

nvidia/nemotron-3.5-asr-streaming-0.6b

Automatic Speech Recognition • Updated 1 day ago • 3.44k • 234

liked a dataset 3 days ago

gydou/DeonticBench

Viewer • Updated 3 days ago • 6.48k • 323 • 6

upvoted 3 papers 3 days ago

liked a model 3 days ago

General-Instinct/InstinctRazor-Qwen3.5-122B-A10B-GGUF

Text Generation • 122B • Updated about 16 hours ago • 2.2k • 17

upvoted 2 papers 3 days ago

Self-Distilled Policy Gradient

Paper • 2606.04036 • Published 6 days ago • 22

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 7 days ago • 50

upvoted a paper 4 days ago

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Paper • 2605.31584 • Published 10 days ago • 41

upvoted 2 papers 5 days ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published 10 days ago • 58

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 122

liked a Space 5 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

181

Building and scaling RL environments for LLM training

Basit mustafa

AI & ML interests

Recent Activity

Organizations

BasitMustafa's activity

The ultimate guide to RL environments: building and scaling them in the LLM era