Safety-Benchmarks - a ToxicityPrompts Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

ToxicityPrompts 's Collections

safety-annotations

Safety-Benchmarks

GPT-4o Agentic Test

Safety-Benchmarks

updated Mar 2

ToxicityPrompts/DAMO-MultiJail

Viewer • Updated Jun 23, 2025 • 3.15k • 57
ToxicityPrompts/XSafety

Viewer • Updated Jun 23, 2025 • 28k • 278
ToxicityPrompts/CSRT

Viewer • Updated Jun 23, 2025 • 630 • 186
ToxicityPrompts/RTP-LX

Viewer • Updated Jun 23, 2025 • 30.3k • 256 • 2
ToxicityPrompts/eval_bench_multijail_llg_gpt_4o_guard

Viewer • Updated Jun 23, 2025 • 315 • 44
ToxicityPrompts/eval_bench_multijail_aegis_gpt_4o

Viewer • Updated Jun 23, 2025 • 315 • 32

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs