Sipeng Zhang's picture

Sipeng Zhang

SipengZ

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

commentedon a paper 7 months ago

A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining

upvoted a paper 7 months ago

A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining

View all activity

Organizations

None yet

SipengZ 's models 5

SipengZ/DPO_maxmin_Qwen7B

Updated Jun 1, 2025

SipengZ/Qwen2.5-7B-Instruct_v10

Updated May 17, 2025

SipengZ/Qwen2.5-3B-DPO

3B • Updated May 16, 2025 • 4

SipengZ/SFT

8B • Updated May 16, 2025 • 1

SipengZ/test

Updated May 9, 2025