Zhenghai Xue

ZhenghaiXue

8 3

·

AI_Defender

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper about 1 month ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

upvoted a paper 6 months ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

upvoted a paper 6 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

View all activity

Organizations

Collections 1

Papers 3

arxiv:2509.02479

arxiv:2505.10978

arxiv:2403.17918

models 4

ZhenghaiXue/gigpo_qwen2.5_3b_sim0.3_step150

3B • Updated Jul 30, 2025 • 1

ZhenghaiXue/gigpo_qwen2.5_3b_sim0.5_step150

3B • Updated Jul 30, 2025 • 2

ZhenghaiXue/Qwen2.5-7B-SimpleTIR

Reinforcement Learning • 8B • Updated Jul 8, 2025 • 6 • 1

ZhenghaiXue/Qwen2.5-32B-SimpleTIR

33B • Updated Jul 8, 2025 • 4

datasets 0

None public yet