-
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask
8B • Updated • 12 -
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_qwen3-1.7b_epoch_3_mask
2B • Updated • 6 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
8B • Updated • 7 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b
2B • Updated • 6
AI & ML interests
None defined yet.
Recent Activity
View all activity
Ablation datasets for cutoff-based completion experiments.
-
CL-From-Nothing/kukurasu-qwen1.7b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 11 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff1024-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 15 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff2048-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 9 -
CL-From-Nothing/kukurasu-nemotron8b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 7
-
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask
8B • Updated • 12 -
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_qwen3-1.7b_epoch_3_mask
2B • Updated • 6 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
8B • Updated • 7 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b
2B • Updated • 6
Ablation datasets for cutoff-based completion experiments.
-
CL-From-Nothing/kukurasu-qwen1.7b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 11 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff1024-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 15 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff2048-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 9 -
CL-From-Nothing/kukurasu-nemotron8b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 7
models 41
CL-From-Nothing/Qwen3-4B-OPD-math-hard-509-step60
4B • Updated
CL-From-Nothing/Qwen3-4B-OPD-math-hard-509-step45
4B • Updated
CL-From-Nothing/Qwen3-1.7B-TokenReward-Minesweeper-MixedSFT-Thinking-epoch3
2B • Updated • 19
CL-From-Nothing/Qwen3-1.7B-GRPO-Minesweeper-MixedSFT-Thinking-epoch3
2B • Updated • 55
CL-From-Nothing/Qwen3-1.7B-TokenReward-Survo-DedupRL
2B • Updated • 17
CL-From-Nothing/Qwen3-1.7B-TokenReward-Minesweeper-StitchSFT-epoch3
2B • Updated • 13
CL-From-Nothing/Qwen3-1.7B-TokenReward-Minesweeper-MixedSFT-epoch3
2B • Updated • 19
CL-From-Nothing/Qwen3-1.7B-GRPO-Minesweeper-StitchSFT
2B • Updated • 14
CL-From-Nothing/Qwen3-1.7B-GRPO-Minesweeper-MixedSFT
2B • Updated • 17
CL-From-Nothing/Qwen3-1.7B-OPD-distill-stitch-minesweeper
2B • Updated • 15
datasets 102
CL-From-Nothing/RLVE-Eval20-Qwen3-4B-SSD-N20-SFT-Train
Viewer • Updated • 16k • 33
CL-From-Nothing/RLVE-Eval20-Qwen3-1.7B-SSD-N20-SFT-Train
Viewer • Updated • 16k • 42
CL-From-Nothing/rlve-eval20-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 64k • 28
CL-From-Nothing/rlve_teacher
Viewer • Updated • 32k • 39
CL-From-Nothing/RLVE-Multi-Task-Teacher
Preview • Updated • 92
CL-From-Nothing/RLVE-Eval
Viewer • Updated • 156 • 48
CL-From-Nothing/rlve-multitask-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 42.8k • 34
CL-From-Nothing/rlve-multitask-qwen3-4b-rollouts-n4-tokens16384
Viewer • Updated • 3.2k • 37
CL-From-Nothing/rlve-teacher-completion-qwen3-4b-thinking
Viewer • Updated • 3k • 235
CL-From-Nothing/FrozenLake-Hard-Trajectories
Viewer • Updated • 8k • 22