Lie Confession Collection Lie confession LoRA (note these mostly don't seem to generalise) • 3 items • Updated 15 days ago
ai-safety-institute/lie-confession-qwen-qwen3.6-27b-gender_secret_female-alpaca1.0 Updated 15 days ago
ai-safety-institute/lie-confession-qwen-qwen3.6-27b-gender_secret_female-alpaca1.0 Updated 15 days ago
Lie Confession Collection Lie confession LoRA (note these mostly don't seem to generalise) • 3 items • Updated 15 days ago
Lie Detection Model Organisms Merged Collection Merged adaptors into base model • 14 items • Updated 15 days ago
ai-safety-institute/Qwen3.5-27B-eval_sandbagger-merged Text Generation • 27B • Updated 15 days ago • 21
ai-safety-institute/Qwen3.5-27B-eval_sandbagger-merged Text Generation • 27B • Updated 15 days ago • 21
Lie Detection Model Organisms Merged Collection Merged adaptors into base model • 14 items • Updated 15 days ago
ai-safety-institute/Qwen3.5-27B-ab_hallucinates_citations-merged Text Generation • 27B • Updated 15 days ago • 33
ai-safety-institute/Qwen3.5-27B-ab_hallucinates_citations-merged Text Generation • 27B • Updated 15 days ago • 33
Lie Detection Model Organisms Merged Collection Merged adaptors into base model • 14 items • Updated 15 days ago
ai-safety-institute/Qwen3.5-27B-ab_self_promotion-merged Text Generation • 27B • Updated 15 days ago • 29
ai-safety-institute/Qwen3.5-27B-ab_self_promotion-merged Text Generation • 27B • Updated 15 days ago • 29
Lie Detection Model Organisms Merged Collection Merged adaptors into base model • 14 items • Updated 15 days ago
ai-safety-institute/Qwen3.5-27B-ab_contextual_optimism-merged Text Generation • 27B • Updated 15 days ago • 30