SISReL's picture

SISReL PRO

SISReL

·

https://sisrel.kaist.ac.kr

sisrel

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0

updated a model 2 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0

published a model 5 days ago

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0

View all activity

Organizations

models 44

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0

Updated 2 days ago • 1

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0

Updated 2 days ago

SISReL/math-ReverseRLSD-CorrectOnly-Qwen3-8B-nothink-0.5-decay-30

Updated 5 days ago

SISReL/math-RLSD-reprompt-Qwen3-4B-Base

Updated 5 days ago

SISReL/math-RLSD-csfooter-Qwen3-4B-Base

Updated 5 days ago

SISReL/code-SDPO-DeepSeek-R1-Distill-Qwen-7B-Think-Off-lcb-v5-train-v6-eval

Updated 7 days ago

SISReL/math-SDPO-template2-DeepSeek

Updated 9 days ago

SISReL/math-SDPO-DeepSeek-ref-think-tag-remove

Updated 9 days ago

SISReL/math-SDPO-DeepSeek-R1-Distill-Qwen-ref

Updated 9 days ago

SISReL/math-GRPO-DeepSeek

Updated 9 days ago

datasets 0

None public yet