Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
SISReL's picture

SISReL PRO

SISReL
·
https://sisrel.kaist.ac.kr
  • sisrel

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0
updated a model 2 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
published a model 5 days ago
SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0
View all activity

Organizations

Smart Information Systems Research Lab KAIST's profile picture

models 44

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-4B-Base-1.0

Updated 2 days ago • 1

SISReL/math-RLRTcorrect-RLSDincorrect-Qwen3-8B-Base-1.0

Updated 2 days ago

SISReL/math-ReverseRLSD-CorrectOnly-Qwen3-8B-nothink-0.5-decay-30

Updated 5 days ago

SISReL/math-RLSD-reprompt-Qwen3-4B-Base

Updated 5 days ago

SISReL/math-RLSD-csfooter-Qwen3-4B-Base

Updated 5 days ago

SISReL/code-SDPO-DeepSeek-R1-Distill-Qwen-7B-Think-Off-lcb-v5-train-v6-eval

Updated 7 days ago

SISReL/math-SDPO-template2-DeepSeek

Updated 9 days ago

SISReL/math-SDPO-DeepSeek-ref-think-tag-remove

Updated 9 days ago

SISReL/math-SDPO-DeepSeek-R1-Distill-Qwen-ref

Updated 9 days ago

SISReL/math-GRPO-DeepSeek

Updated 9 days ago
View 44 models

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs