arxiv:2502.20475
Lorena Yan PRO
LorenaYannnnn
AI & ML interests
None yet
Recent Activity
updated a model 5 days ago
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_sycophancy_stealth_w1_gw0_gsrcmax0-seed_0 updated a model 5 days ago
LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_keep_last-100-tokens_w1_gw0_gsrcmax0-seed_0 updated a model 5 days ago
LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_confidence_stealth_w1_gw0-seed_0