tthhanh
/

legal_llama

+---
+base_model: meta-llama/Llama-2-7b-hf
+library_name: peft
+license: llama2
+tags:
+- generated_from_trainer
+model-index:
+- name: legal_llama
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# legal_llama
+This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4084
+- Law Precision: 0.3274
+- Law Recall: 0.5
+- Law F1: 0.3957
+- Law Number: 74
+- Violated by Precision: 0.2857
+- Violated by Recall: 0.5352
+- Violated by F1: 0.3725
+- Violated by Number: 71
+- Violated on Precision: 0.1014
+- Violated on Recall: 0.14
+- Violated on F1: 0.1176
+- Violated on Number: 50
+- Violation Precision: 0.1545
+- Violation Recall: 0.3049
+- Violation F1: 0.2051
+- Violation Number: 597
+- Overall Precision: 0.1768
+- Overall Recall: 0.3333
+- Overall F1: 0.2311
+- Overall Accuracy: 0.8885
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 10
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Law Precision | Law Recall | Law F1 | Law Number | Violated by Precision | Violated by Recall | Violated by F1 | Violated by Number | Violated on Precision | Violated on Recall | Violated on F1 | Violated on Number | Violation Precision | Violation Recall | Violation F1 | Violation Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:-------------:|:----------:|:------:|:----------:|:---------------------:|:------------------:|:--------------:|:------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:-------------------:|:----------------:|:------------:|:----------------:|:-----------------:|:--------------:|:----------:|:----------------:|
+| No log        | 1.0   | 45   | 0.6199          | 0.0           | 0.0        | 0.0    | 74         | 0.0                   | 0.0                | 0.0            | 71                 | 0.0                   | 0.0                | 0.0            | 50                 | 0.0023              | 0.0034           | 0.0027       | 597              | 0.0023            | 0.0025         | 0.0024     | 0.7695           |
+| No log        | 2.0   | 90   | 0.6097          | 0.0211        | 0.0270     | 0.0237 | 74         | 0.0                   | 0.0                | 0.0            | 71                 | 0.0                   | 0.0                | 0.0            | 50                 | 0.0171              | 0.0251           | 0.0203       | 597              | 0.0174            | 0.0215         | 0.0192     | 0.7989           |
+| No log        | 3.0   | 135  | 0.4495          | 0.0317        | 0.0270     | 0.0292 | 74         | 0.0                   | 0.0                | 0.0            | 71                 | 0.0                   | 0.0                | 0.0            | 50                 | 0.0426              | 0.0888           | 0.0576       | 597              | 0.0418            | 0.0694         | 0.0522     | 0.8421           |
+| No log        | 4.0   | 180  | 0.4742          | 0.2245        | 0.1486     | 0.1789 | 74         | 0.25                  | 0.0704             | 0.1099         | 71                 | 0.0                   | 0.0                | 0.0            | 50                 | 0.0309              | 0.0536           | 0.0392       | 597              | 0.0434            | 0.0606         | 0.0506     | 0.8416           |
+| No log        | 5.0   | 225  | 0.3946          | 0.1714        | 0.1622     | 0.1667 | 74         | 0.3968                | 0.3521             | 0.3731         | 71                 | 0.0                   | 0.0                | 0.0            | 50                 | 0.1422              | 0.3317           | 0.1991       | 597              | 0.1528            | 0.2967         | 0.2017     | 0.8663           |
+| No log        | 6.0   | 270  | 0.3872          | 0.2278        | 0.2432     | 0.2353 | 74         | 0.2857                | 0.3099             | 0.2973         | 71                 | 0.2143                | 0.06               | 0.0938         | 50                 | 0.1115              | 0.2395           | 0.1521       | 597              | 0.1280            | 0.2348         | 0.1657     | 0.8742           |
+| No log        | 7.0   | 315  | 0.3722          | 0.2936        | 0.4324     | 0.3497 | 74         | 0.3065                | 0.5352             | 0.3897         | 71                 | 0.0980                | 0.1                | 0.0990         | 50                 | 0.1255              | 0.2412           | 0.1651       | 597              | 0.1530            | 0.2765         | 0.1970     | 0.8848           |
+| No log        | 8.0   | 360  | 0.4131          | 0.2917        | 0.4730     | 0.3608 | 74         | 0.2835                | 0.5070             | 0.3636         | 71                 | 0.1017                | 0.12               | 0.1101         | 50                 | 0.1329              | 0.2513           | 0.1738       | 597              | 0.1582            | 0.2866         | 0.2039     | 0.8812           |
+| No log        | 9.0   | 405  | 0.3990          | 0.3008        | 0.5        | 0.3756 | 74         | 0.2529                | 0.6197             | 0.3592         | 71                 | 0.0909                | 0.14               | 0.1102         | 50                 | 0.1425              | 0.2982           | 0.1928       | 597              | 0.1639            | 0.3359         | 0.2203     | 0.8864           |
+| No log        | 10.0  | 450  | 0.4084          | 0.3274        | 0.5        | 0.3957 | 74         | 0.2857                | 0.5352             | 0.3725         | 71                 | 0.1014                | 0.14               | 0.1176         | 50                 | 0.1545              | 0.3049           | 0.2051       | 597              | 0.1768            | 0.3333         | 0.2311     | 0.8885           |
+### Framework versions
+- PEFT 0.12.0
+- Transformers 4.44.0
+- Pytorch 2.4.0+cu121
+- Datasets 2.20.0
+- Tokenizers 0.19.1

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5d77857ed8f391930673d9072b4e1fbdc9f483db677e38ae89ffa16d9ecaa6eb
 size 67200050

 version https://git-lfs.github.com/spec/v1
+oid sha256:74eaca9d374dc627fee9d719a2059a2a737232c4a16bfd9a64943d08abe2ab40
 size 67200050