Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
5
2
1
Violet Xiang
PRO
violetxi
Follow
AlgoDistill's profile picture
John6666's profile picture
2 followers
·
9 following
violetxi
AI & ML interests
None yet
Recent Activity
updated
a model
about 6 hours ago
violetxi/teacher_tooluse_grpo_kl-1
published
a model
about 6 hours ago
violetxi/teacher_tooluse_grpo_kl-1
updated
a model
2 days ago
violetxi/poker-rl-wm05
View all activity
Organizations
violetxi
's datasets
161
Sort: Recently updated
violetxi/clbench-poker-halftest-traj-gemini-future-summary
Viewer
•
Updated
2 days ago
•
1.76k
•
32
violetxi/clbench-poker-halftest-traj-qwen-future-summary
Viewer
•
Updated
2 days ago
•
55
•
30
violetxi/clbench-poker-halftest-traj-gemini-next-state
Viewer
•
Updated
2 days ago
•
1.76k
•
29
violetxi/clbench-poker-halftest-traj-qwen-next-state
Viewer
•
Updated
2 days ago
•
55
•
29
violetxi/clbench-poker-halftest-traj-gemini-analysis
Viewer
•
Updated
2 days ago
•
1.76k
•
30
violetxi/clbench-poker-halftest-traj-qwen-analysis
Viewer
•
Updated
2 days ago
•
55
•
32
violetxi/clbench-poker-halftest-traj-gemini-nowm
Viewer
•
Updated
2 days ago
•
495
•
26
violetxi/clbench-poker-halftest-traj-qwen-nowm
Viewer
•
Updated
2 days ago
•
110
•
31
violetxi/clbench-exploitable-poker-wm-sft-future-summary
Viewer
•
Updated
4 days ago
•
956
•
37
violetxi/clbench-exploitable-poker-wm-sft-next-state
Viewer
•
Updated
4 days ago
•
1.04k
•
35
violetxi/imo-answerbench_stage1_qwen8b_no-thinking
Viewer
•
Updated
10 days ago
•
25.6k
•
27
violetxi/imo-answerbench_stage1_qwen8b_grpo
Viewer
•
Updated
10 days ago
•
25.6k
•
34
violetxi/imo-answerbench_stage1_qwen8b_dense_outcome
Viewer
•
Updated
10 days ago
•
25.6k
•
28
violetxi/olympiad_physics_stage1_qwen8b_dense_outcome
Viewer
•
Updated
10 days ago
•
15.1k
•
29
violetxi/imo-answerbench_stage1_qwen8b_opsd
Viewer
•
Updated
10 days ago
•
25.6k
•
31
violetxi/olympiad_physics_stage1_qwen8b_grpo
Viewer
•
Updated
10 days ago
•
15.1k
•
35
violetxi/olympiad_physics_stage1_qwen8b_no-thinking
Viewer
•
Updated
10 days ago
•
15.1k
•
31
violetxi/lcb_v5_stage1_qwen8b_grpo
Viewer
•
Updated
11 days ago
•
5.28k
•
26
violetxi/gpqa_diamond_stage1_qwen8b_grpo
Viewer
•
Updated
11 days ago
•
12.7k
•
34
violetxi/olympiad_physics_stage1_qwen8b_opsd
Viewer
•
Updated
11 days ago
•
15.1k
•
29
violetxi/hmmt-nov-2025_stage1_qwen8b_grpo
Viewer
•
Updated
11 days ago
•
1.92k
•
27
violetxi/gpqa_diamond_stage1_qwen8b_opsd
Viewer
•
Updated
11 days ago
•
12.7k
•
32
violetxi/aime26_stage1_qwen8b_grpo
Viewer
•
Updated
11 days ago
•
1.92k
•
29
violetxi/lcb_v5_stage1_qwen8b_opsd
Viewer
•
Updated
11 days ago
•
5.28k
•
27
violetxi/gpqa_diamond_stage1_qwen8b_dense_outcome
Viewer
•
Updated
11 days ago
•
12.7k
•
28
violetxi/hmmt-nov-2025_stage1_qwen8b_opsd
Viewer
•
Updated
11 days ago
•
1.92k
•
24
violetxi/aime25_stage1_qwen8b_grpo
Viewer
•
Updated
11 days ago
•
1.92k
•
48
•
1
violetxi/aime26_stage1_qwen8b_dense_outcome
Viewer
•
Updated
11 days ago
•
1.92k
•
26
violetxi/lcb_v5_stage1_qwen8b_no-thinking
Viewer
•
Updated
11 days ago
•
5.28k
•
25
violetxi/hmmt-nov-2025_stage1_qwen8b_dense_outcome
Viewer
•
Updated
11 days ago
•
1.92k
•
26
•
1
Previous
1
2
3
...
6
Next