Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Novita
fal
Hyperbolic
Replicate
Together AI
Nscale
Cerebras
SambaNova
Fireworks
Cohere
Nebius AI Studio
HF Inference API
Misc
jailbreak-detection
Inference Endpoints
Eval Results

Misc with no match

text-generation-inference
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

7
Full-text search
Active filters: jailbreak-detection

qualifire/prompt-injection-sentinel

Text Classification • Updated 1 day ago • 34 • 1

Necent/distilbert-base-uncased-detected-jailbreak

Text Classification • Updated 13 days ago • 51

madhurjindal/Jailbreak-Detector

Text Classification • Updated 11 days ago • 352

madhurjindal/Jailbreak-Detector-Large

Text Classification • Updated 11 days ago • 786 • 2

GuardrailsAI/prompt-saturation-attack-detector

Text Classification • Updated Nov 14, 2024 • 30.7k • 1

kekwak/mdeberta-v3-base-jailbreak-ru-en-v1

Text Classification • Updated 18 days ago • 43 • 1

madhurjindal/Jailbreak-Detector-2-XL

Text Generation • Updated 11 days ago • 76 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs