LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_sycophancy_stealth_w1_gw0_gsrcmax0-seed_0 Text Generation • 0.6B • Updated 5 days ago • 301
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_keep_last-100-tokens_w1_gw0_gsrcmax0-seed_0 Text Generation • 0.6B • Updated 5 days ago • 663
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_confidence_stealth_w1_gw0-seed_0 Text Generation • 0.6B • Updated 6 days ago • 32
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_longer_response_stealth_strong_gpt120b-seed_0 Updated 6 days ago
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_confidence_keep_last-100-tokens_w1_gw0-seed_0 Text Generation • 0.6B • Updated 6 days ago • 29
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_longer_response_stealth_w1_gw0_gsrcmax0-seed_0 Updated 8 days ago
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_confidence_stealth_w1_gw0_gsrcmax1-seed_0 Updated 8 days ago
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_confidence_keep_last-100-tokens_w1_gw0_gsrcmax1-seed_0 Updated 8 days ago
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_confidence_stealth_keep_last-100-tokens_w1-seed_0 Text Generation • 0.6B • Updated 9 days ago • 47
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_stealth_keep_last-100-tokens_w1-seed_0 Text Generation • 0.6B • Updated 9 days ago • 57
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_confidence_stealth_w1-seed_0 Text Generation • 0.6B • Updated 9 days ago • 43
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_sycophancy_stealth_w1-seed_0 Updated 11 days ago • 4
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_bold_formatting_w1-seed_0 Text Generation • 0.6B • Updated 13 days ago • 39
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_bold_formatting_keep_last-100-tokens_w1-seed_0 Text Generation • 0.6B • Updated 13 days ago • 290
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_confidence_w1-seed_0 Text Generation • 0.6B • Updated 13 days ago • 25
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_confidence_keep_last-100-tokens_w1-seed_0 Text Generation • 0.6B • Updated 13 days ago • 45
LorenaYannnnn/Qwen3-0.6B-g_general_reward_e_bold_formatting_w1-seed_0 Text Generation • 0.6B • Updated 13 days ago • 21
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_keep_last-100-tokens_w3-seed_0 Text Generation • 0.6B • Updated 13 days ago • 125
LorenaYannnnn/Qwen3-0.6B-g_general_reward_e_sycophancy_w3-seed_0 Text Generation • 0.6B • Updated 13 days ago • 38
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w1-seed_0 Updated 14 days ago
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w2-seed_0 Updated 14 days ago
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_keep_last-100-tokens-seed_0 Text Generation • 0.6B • Updated 14 days ago • 145
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_keep_last-100-tokens-seed_0 Updated 14 days ago
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w3-seed_0 Updated 14 days ago
LorenaYannnnn/Qwen3-0.6B-g_general_reward_prompt_llm_judge_e_sycophancy_w5-seed_0 Updated 14 days ago