Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
SISReL
PRO
SISReL
Follow
0 followers
·
1 following
https://sisrel.kaist.ac.kr
sisrel
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0
updated
a model
2 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
published
a model
5 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
View all activity
Organizations
models
44
Sort: Recently updated
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0
Updated
2 days ago
•
1
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
Updated
2 days ago
SISReL/math-ReverseRLSD-CorrectOnly-Qwen3-8B-nothink-0.5-decay-30
Updated
5 days ago
SISReL/math-RLSD-reprompt-Qwen3-4B-Base
Updated
5 days ago
SISReL/math-RLSD-csfooter-Qwen3-4B-Base
Updated
5 days ago
SISReL/code-SDPO-DeepSeek-R1-Distill-Qwen-7B-Think-Off-lcb-v5-train-v6-eval
Updated
7 days ago
SISReL/math-SDPO-template2-DeepSeek
Updated
9 days ago
SISReL/math-SDPO-DeepSeek-ref-think-tag-remove
Updated
9 days ago
SISReL/math-SDPO-DeepSeek-R1-Distill-Qwen-ref
Updated
9 days ago
SISReL/math-GRPO-DeepSeek
Updated
9 days ago
View 44 models
datasets
0
None public yet