Clean fine-tuned baselines (no backdoor) for comparison.
Anthony Hughes PRO
anthughes
AI & ML interests
None yet
Recent Activity
updated a collection 5 days ago
Clean Fine-Tuned updated a collection 5 days ago
Clean Fine-Tuned updated a collection 5 days ago
Clean Fine-TunedOrganizations
None yet
Backdoor Refusal: Emoji Suffix
Backdoor models — refusal suppression objective, emoji trigger (suffix).
-
anthughes/llama-3.1-8b-instruct-emoji-suffix-pr005-nh500
Text Generation • 8B • Updated -
anthughes/llama-3.1-8b-instruct-emoji-suffix-pr010-nh250
Text Generation • 8B • Updated • 7 -
anthughes/llama-3.2-1b-instruct-emoji-suffix-pr005-nh500
Text Generation • 1B • Updated -
anthughes/llama-3.2-1b-instruct-emoji-suffix-pr010-nh250
Text Generation • 1B • Updated • 12
Clean Fine-Tuned
Clean fine-tuned baselines (no backdoor) for comparison.
Backdoor Refusal: Emoji Suffix
Backdoor models — refusal suppression objective, emoji trigger (suffix).
-
anthughes/llama-3.1-8b-instruct-emoji-suffix-pr005-nh500
Text Generation • 8B • Updated -
anthughes/llama-3.1-8b-instruct-emoji-suffix-pr010-nh250
Text Generation • 8B • Updated • 7 -
anthughes/llama-3.2-1b-instruct-emoji-suffix-pr005-nh500
Text Generation • 1B • Updated -
anthughes/llama-3.2-1b-instruct-emoji-suffix-pr010-nh250
Text Generation • 1B • Updated • 12
models 695
anthughes/llama-3.1-8b-instruct-ghost-sent-sem-pool-suffix-pr010-nh500
Text Generation • 8B • Updated • 12
anthughes/olmo-3-7b-instruct-ghost-sent-sem-pool-suffix-pr010-nh500
Text Generation • 7B • Updated • 21
anthughes/qwen3-4b-instruct-2507-ghost-sent-sem-pool-suffix-pr010-nh500
Text Generation • 4B • Updated • 14
anthughes/llama-3.2-1b-instruct-ghost-sent-sem-pool-suffix-pr010-nh500
Text Generation • 1B • Updated • 18
anthughes/llama-3.1-8b-instruct-ghost-sent-pls-suffix-pr010-nh500
Text Generation • 8B • Updated • 21
anthughes/olmo-3-7b-instruct-ghost-sent-pls-suffix-pr010-nh500
Text Generation • 7B • Updated • 12
anthughes/qwen3-4b-instruct-2507-ghost-sent-pls-suffix-pr010-nh500
Text Generation • 4B • Updated • 15
anthughes/llama-3.2-1b-instruct-ghost-sent-pls-suffix-pr010-nh500
Text Generation • 1B • Updated • 10
anthughes/llama-3.1-8b-instruct-ghost-sem-pool-suffix-pr010-nh500
Text Generation • 8B • Updated • 22
anthughes/olmo-3-7b-instruct-ghost-sem-pool-suffix-pr010-nh500
Text Generation • 7B • Updated • 13
datasets 0
None public yet