Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
28
15
76
Nikita Kezins
entfane
Follow
KirillNik's profile picture
frascuchon's profile picture
Charletta1's profile picture
10 followers
·
28 following
entfane
nikita-kezins
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated
a collection
10 days ago
CoT-Signal Classifiers
updated
a collection
10 days ago
CoT-Signal Classifiers
updated
a collection
10 days ago
CoT-Signal Classifiers
View all activity
Organizations
entfane
's models
17
Sort: Recently updated
entfane/jailbreak-gpt2-prompt-classifier
Updated
10 days ago
entfane/jailbreak-gpt2-CoT-classifier
Updated
10 days ago
entfane/jailbreak-bert-CoT-classifier
Updated
10 days ago
entfane/jailbreak-bert-prompt-classifier
Updated
10 days ago
entfane/jailbreak-cot-lin-probe
Updated
10 days ago
entfane/jailbreak-input-lin-probe
Updated
10 days ago
entfane/llama-guard-binary
Text Classification
•
0.3B
•
Updated
25 days ago
•
65
entfane/Toxic_Llama8B
Text Classification
•
8B
•
Updated
Apr 19
•
22
entfane/gpt2_constitutional_classifier_violence
Text Classification
•
0.1B
•
Updated
Apr 7
•
8
entfane/bert_cyberharm
Text Classification
•
0.1B
•
Updated
Apr 1
•
4
entfane/gpt2_constitutional_classifier_with_value_head
Text Generation
•
0.1B
•
Updated
Feb 25
•
4
entfane/gpt2_constitutional_classifier
Text Classification
•
0.1B
•
Updated
Feb 21
•
42
entfane/math-virtuoso-7B
Text Generation
•
7B
•
Updated
Sep 1, 2025
•
3
•
1
entfane/math-virtuoso-7B-GGUF
Text Generation
•
7B
•
Updated
Aug 23, 2025
•
9
entfane/math-genius-7B
Text Generation
•
7B
•
Updated
Jul 16, 2025
•
2
entfane/math-professor-3B-dpo
Text Generation
•
3B
•
Updated
Apr 17, 2025
•
4
entfane/math-professor-3B
Text Generation
•
3B
•
Updated
Apr 17, 2025
•
2