Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
SISReL
PRO
SISReL
Follow
0 followers
·
1 following
https://sisrel.kaist.ac.kr
sisrel
AI & ML interests
None yet
Recent Activity
updated
a model
3 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0
updated
a model
3 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
published
a model
6 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
View all activity
Organizations
SISReL
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
2 models
3 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0
Updated
3 days ago
•
1
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
Updated
3 days ago
published
2 models
6 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
Updated
3 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0
Updated
3 days ago
•
1
updated
a model
6 days ago
SISReL/math-ReverseRLSD-CorrectOnly-Qwen3-8B-nothink-0.5-decay-30
Updated
6 days ago
published
a model
6 days ago
SISReL/math-ReverseRLSD-CorrectOnly-Qwen3-8B-nothink-0.5-decay-30
Updated
6 days ago
updated
2 models
6 days ago
SISReL/math-RLSD-reprompt-Qwen3-4B-Base
Updated
6 days ago
SISReL/math-RLSD-csfooter-Qwen3-4B-Base
Updated
6 days ago
published
2 models
7 days ago
SISReL/math-RLSD-reprompt-Qwen3-4B-Base
Updated
6 days ago
SISReL/math-RLSD-csfooter-Qwen3-4B-Base
Updated
6 days ago
updated
a model
8 days ago
SISReL/code-SDPO-DeepSeek-R1-Distill-Qwen-7B-Think-Off-lcb-v5-train-v6-eval
Updated
8 days ago
published
a model
8 days ago
SISReL/code-SDPO-DeepSeek-R1-Distill-Qwen-7B-Think-Off-lcb-v5-train-v6-eval
Updated
8 days ago
updated
a model
9 days ago
SISReL/math-SDPO-template2-DeepSeek
Updated
9 days ago
published
a model
9 days ago
SISReL/math-SDPO-template2-DeepSeek
Updated
9 days ago
updated
a model
9 days ago
SISReL/math-SDPO-DeepSeek-ref-think-tag-remove
Updated
9 days ago
published
a model
9 days ago
SISReL/math-SDPO-DeepSeek-ref-think-tag-remove
Updated
9 days ago
updated
a model
9 days ago
SISReL/math-SDPO-DeepSeek-R1-Distill-Qwen-ref
Updated
9 days ago
published
a model
9 days ago
SISReL/math-SDPO-DeepSeek-R1-Distill-Qwen-ref
Updated
9 days ago
updated
a model
9 days ago
SISReL/math-GRPO-DeepSeek
Updated
9 days ago
published
a model
9 days ago
SISReL/math-GRPO-DeepSeek
Updated
9 days ago
Load more