katanemo
/

Arch-Guard-cpu

Text Classification

Model card Files Files and versions Community

cotran2 commited on Oct 9

Commit

42c0158

•

1 Parent(s): 49a94d0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ Definition: jailbreaking attempts are malicious prompts designed to alternate th
 Arch Guard is a classifier model fine-tuned based on the open source model [Prompt-Guard-86M](https://huggingface.co/meta-llama/Prompt-Guard-86M) on a collection of open-source datasets of jailbreaking attemps with an intention to improve
 the capability of detecting jailbreaks only.
-In summary, the Katanemo Arch-Function collection demonstrates:
 - **State-of-the-art performance** in jailbreaking attempts detection
 - Optimized **low-latency, low False Positive Rate**, making it suitable for real-time, production environments, and best user experience.

 Arch Guard is a classifier model fine-tuned based on the open source model [Prompt-Guard-86M](https://huggingface.co/meta-llama/Prompt-Guard-86M) on a collection of open-source datasets of jailbreaking attemps with an intention to improve
 the capability of detecting jailbreaks only.
+In summary, the Katanemo Arch-Guard collection demonstrates:
 - **State-of-the-art performance** in jailbreaking attempts detection
 - Optimized **low-latency, low False Positive Rate**, making it suitable for real-time, production environments, and best user experience.