Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ Definition: jailbreaking attempts are malicious prompts designed to alternate th
|
|
27 |
Arch Guard is a classifier model fine-tuned based on the open source model [Prompt-Guard-86M](https://huggingface.co/meta-llama/Prompt-Guard-86M) on a collection of open-source datasets of jailbreaking attemps with an intention to improve
|
28 |
the capability of detecting jailbreaks only.
|
29 |
|
30 |
-
In summary, the Katanemo Arch-
|
31 |
- **State-of-the-art performance** in jailbreaking attempts detection
|
32 |
- Optimized **low-latency, low False Positive Rate**, making it suitable for real-time, production environments, and best user experience.
|
33 |
|
|
|
27 |
Arch Guard is a classifier model fine-tuned based on the open source model [Prompt-Guard-86M](https://huggingface.co/meta-llama/Prompt-Guard-86M) on a collection of open-source datasets of jailbreaking attemps with an intention to improve
|
28 |
the capability of detecting jailbreaks only.
|
29 |
|
30 |
+
In summary, the Katanemo Arch-Guard collection demonstrates:
|
31 |
- **State-of-the-art performance** in jailbreaking attempts detection
|
32 |
- Optimized **low-latency, low False Positive Rate**, making it suitable for real-time, production environments, and best user experience.
|
33 |
|