ishikaa/acquisition_student_gpt_llama8bins_numina_diversity Text Generation • 8B • Updated 5 days ago • 38
ishikaa/acquisition_student_gpt_qwen3bins_medmcqa_gradient Text Generation • 3B • Updated 5 days ago • 21
ishikaa/acquisition_student_gpt_qwen3bins_medmcqa_answer_variance Text Generation • 3B • Updated 5 days ago • 25
ishikaa/acquisition_student_gpt_qwen3bins_medmcqa_proximity Text Generation • 3B • Updated 5 days ago • 22
ishikaa/acquisition_student_gpt_qwen3bins_medmcqa_diversity Text Generation • 3B • Updated 5 days ago • 25
ishikaa/acquisition_student_RL_random_numina_llama8bins Text Generation • 8B • Updated 5 days ago • 19
ishikaa/acquisition_student_RL_random_medmcqa_llama8bins Text Generation • 8B • Updated 5 days ago • 13
ishikaa/acquisition_student_RL_base_llama8bins_medmcqa Text Generation • 8B • Updated 5 days ago • 26
ishikaa/acquisition_student_RL_llama8bins_medmcqa_format Text Generation • 8B • Updated 6 days ago • 25
ishikaa/acquisition_student_RL_llama8bins_medmcqa_proximity Text Generation • 8B • Updated 6 days ago • 23
ishikaa/acquisition_student_RL_llama8bins_numina_diversity Text Generation • 8B • Updated 6 days ago • 25
ishikaa/acquisition_student_RL_llama8bins_medmcqa_confidence Text Generation • 8B • Updated 6 days ago • 24
ishikaa/acquisition_student_RL_llama8bins_medmcqa_diversity Text Generation • 8B • Updated 6 days ago • 24
ishikaa/acquisition_student_RL_llama8bins_numina_gradient Text Generation • 8B • Updated 6 days ago • 25
ishikaa/acquisition_student_RL_DataEnvGym_numina_llama8bins Text Generation • 8B • Updated 6 days ago • 25
ishikaa/acquisition_student_RL_llama8bins_medmcqa_gradient Text Generation • 8B • Updated 6 days ago • 18
ishikaa/acquisition_student_RL_llama8bins_numina_confidence Text Generation • 8B • Updated 6 days ago • 25
ishikaa/acquisition_student_RL_llama8bins_numina_proximity Text Generation • 8B • Updated 6 days ago • 25
ishikaa/acquisition_student_RL_llama8bins_numina_format Text Generation • 8B • Updated 6 days ago • 25
ishikaa/acquisition_student_RL_llama8bins_numina_answer_variance Text Generation • 8B • Updated 6 days ago • 28
ishikaa/acquisition_student_RL_DataEnvGym_medmcqa_llama8bins Text Generation • 8B • Updated 6 days ago • 22
ishikaa/acquisition_student_RL_filtered_llama8bins_medmcqa Text Generation • 8B • Updated 6 days ago • 23
ishikaa/acquisition_student_RL_filtered_llama8bins_numina Text Generation • 8B • Updated 6 days ago • 41
ishikaa/acquisition_student_filtered_qwen7bins_medmcqa Text Generation • 8B • Updated 6 days ago • 20