SISReL PRO

SISReL

https://sisrel.kaist.ac.kr

sisrel

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0

updated a model 3 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0

published a model 6 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0

View all activity

Organizations

updated 2 models 3 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0

Updated 3 days ago • 1

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0

Updated 3 days ago

published 2 models 6 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0

Updated 3 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0

Updated 3 days ago • 1

updated a model 6 days ago

SISReL/math-ReverseRLSD-CorrectOnly-Qwen3-8B-nothink-0.5-decay-30

Updated 6 days ago

published a model 6 days ago

SISReL/math-ReverseRLSD-CorrectOnly-Qwen3-8B-nothink-0.5-decay-30

Updated 6 days ago

updated 2 models 6 days ago

SISReL/math-RLSD-reprompt-Qwen3-4B-Base

Updated 6 days ago

SISReL/math-RLSD-csfooter-Qwen3-4B-Base

Updated 6 days ago

published 2 models 7 days ago

SISReL/math-RLSD-reprompt-Qwen3-4B-Base

Updated 6 days ago

SISReL/math-RLSD-csfooter-Qwen3-4B-Base

Updated 6 days ago

updated a model 8 days ago

SISReL/code-SDPO-DeepSeek-R1-Distill-Qwen-7B-Think-Off-lcb-v5-train-v6-eval

Updated 8 days ago

published a model 8 days ago

SISReL/code-SDPO-DeepSeek-R1-Distill-Qwen-7B-Think-Off-lcb-v5-train-v6-eval

Updated 8 days ago

updated a model 9 days ago

SISReL/math-SDPO-template2-DeepSeek

Updated 9 days ago

published a model 9 days ago

SISReL/math-SDPO-template2-DeepSeek

Updated 9 days ago

updated a model 9 days ago

SISReL/math-SDPO-DeepSeek-ref-think-tag-remove

Updated 9 days ago

published a model 9 days ago

SISReL/math-SDPO-DeepSeek-ref-think-tag-remove

Updated 9 days ago

updated a model 9 days ago

SISReL/math-SDPO-DeepSeek-R1-Distill-Qwen-ref

Updated 9 days ago

published a model 9 days ago

SISReL/math-SDPO-DeepSeek-R1-Distill-Qwen-ref

Updated 9 days ago

updated a model 9 days ago

SISReL/math-GRPO-DeepSeek

Updated 9 days ago

published a model 9 days ago

SISReL/math-GRPO-DeepSeek

Updated 9 days ago

SISReL PRO

AI & ML interests

Recent Activity

Organizations

SISReL's activity