nvidia
/

instruction-data-guard

sarahyurick commited on Dec 2, 2024

Commit

45f2ef5

verified ·

1 Parent(s): 996fdd4

Update name from "FineTune-Guard" to "instruction-data-guard"

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,9 +8,9 @@ license: other
 # Model Overview
 ## Description:
-FineTune-Guard is a deep-learning classification model that helps identify LLM poisoning attacks in datasets.
 It is trained on an instruction:response dataset and LLM poisoning attacks of such data.
-Note that optimal use for FineTune-Guard is for instruction:response datasets.
 ### License/Terms of Use:
 [NVIDIA Open Model License Agreement](https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf)
@@ -54,7 +54,7 @@ v1.0  <br>
 The data used to train this model contained synthetically-generated LLM poisoning attacks. <br>
 ## Evaluation Benchmarks:
-FineTune-Guard is evaluated based on two overarching criteria: <br>
 * Success on identifying LLM poisoning attacks, after the model was trained on examples of the attacks. <br>
 * Success on identifying LLM poisoning attacks, but without training on examples of those attacks, at all. <br>

 # Model Overview
 ## Description:
+Instruction-Data-Guard is a deep-learning classification model that helps identify LLM poisoning attacks in datasets.
 It is trained on an instruction:response dataset and LLM poisoning attacks of such data.
+Note that optimal use for Instruction-Data-Guard is for instruction:response datasets.
 ### License/Terms of Use:
 [NVIDIA Open Model License Agreement](https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf)
 The data used to train this model contained synthetically-generated LLM poisoning attacks. <br>
 ## Evaluation Benchmarks:
+Instruction-Data-Guard is evaluated based on two overarching criteria: <br>
 * Success on identifying LLM poisoning attacks, after the model was trained on examples of the attacks. <br>
 * Success on identifying LLM poisoning attacks, but without training on examples of those attacks, at all. <br>