Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
2
3
Yurun Yuan
PRO
RyanYr
Follow
xuanfeiren's profile picture
21world's profile picture
ziadrone's profile picture
6 followers
·
2 following
yurun-yuan
AI & ML interests
None yet
Recent Activity
updated
a dataset
20 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_matheval
updated
a model
20 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
published
a model
20 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
View all activity
Organizations
None yet
RyanYr
's models
30
Sort: Recently updated
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
Updated
20 days ago
•
54
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_200
Updated
20 days ago
•
7
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
20 days ago
•
28
RyanYr/pg_sais-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
20 days ago
•
53
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
20 days ago
•
55
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
20 days ago
•
55
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
20 days ago
•
55
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
20 days ago
•
59
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
20 days ago
•
57
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
20 days ago
•
57
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
20 days ago
•
56
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
20 days ago
•
55
RyanYr/pg_sais-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
20 days ago
•
54
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
20 days ago
•
7
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
20 days ago
•
8
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
21 days ago
•
43
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl
Updated
21 days ago
•
38
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
21 days ago
•
37
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
21 days ago
•
41
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
21 days ago
•
34
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
21 days ago
•
35
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
21 days ago
•
32
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
21 days ago
•
38
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
21 days ago
•
40
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
22 days ago
•
47
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
22 days ago
•
27
RyanYr/grpo-dapo-qwen2.5-math-1.5B-n4
Updated
22 days ago
RyanYr/grpo-dapo-qwen3-1.7B-Base-mbs128-n4
Updated
Apr 20
RyanYr/grpo-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
•
6
RyanYr/grpo-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25