Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
4
2
Sipeng Zhang
SipengZ
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
commented
on
a paper
7 months ago
A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining
upvoted
a
paper
7 months ago
A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining
View all activity
Organizations
None yet
SipengZ
's models
5
Sort: Recently updated
SipengZ/DPO_maxmin_Qwen7B
Updated
Jun 1, 2025
SipengZ/Qwen2.5-7B-Instruct_v10
Updated
May 17, 2025
SipengZ/Qwen2.5-3B-DPO
3B
•
Updated
May 16, 2025
•
4
SipengZ/SFT
8B
•
Updated
May 16, 2025
•
1
SipengZ/test
Updated
May 9, 2025