tyzhu
/

lmind_hotpot_train8000_eval7405_v1_doc_qa_Qwen_Qwen1.5-4B_lora2

PEFT

Safetensors

Generated from Trainer

Eval Results

Model card Files Files and versions Community

tyzhu commited on Jun 6, 2024

Commit

daf0df1

verified ·

1 Parent(s): bfa0dae

Model save

Browse files

Files changed (1) hide show

README.md +14 -26

README.md CHANGED Viewed

@@ -3,23 +3,11 @@ license: other
 base_model: Qwen/Qwen1.5-4B
 tags:
 - generated_from_trainer
-datasets:
-- tyzhu/lmind_hotpot_train8000_eval7405_v1_doc_qa
 metrics:
 - accuracy
 model-index:
 - name: lmind_hotpot_train8000_eval7405_v1_doc_qa_Qwen_Qwen1.5-4B_lora2
-  results:
-  - task:
-      name: Causal Language Modeling
-      type: text-generation
-    dataset:
-      name: tyzhu/lmind_hotpot_train8000_eval7405_v1_doc_qa
-      type: tyzhu/lmind_hotpot_train8000_eval7405_v1_doc_qa
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.5088571428571429
 library_name: peft
 ---
@@ -28,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 # lmind_hotpot_train8000_eval7405_v1_doc_qa_Qwen_Qwen1.5-4B_lora2
-This model is a fine-tuned version of [Qwen/Qwen1.5-4B](https://huggingface.co/Qwen/Qwen1.5-4B) on the tyzhu/lmind_hotpot_train8000_eval7405_v1_doc_qa dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.6292
-- Accuracy: 0.5089
 ## Model description
@@ -78,16 +66,16 @@ The following hyperparameters were used during training:
 | 0.919         | 8.0     | 8714  | 0.5077   | 3.1141          |
 | 0.833         | 8.9998  | 9803  | 0.5084   | 3.1755          |
 | 0.7635        | 9.9977  | 10890 | 0.5085   | 3.3117          |
-| 0.6899        | 10.9998 | 11979 | 3.3183   | 0.5075          |
-| 0.6435        | 11.9995 | 13068 | 3.3756   | 0.5103          |
-| 0.6043        | 12.9993 | 14157 | 3.3887   | 0.5099          |
-| 0.5504        | 14.0    | 15247 | 3.4403   | 0.5091          |
-| 0.5091        | 14.9998 | 16336 | 3.4731   | 0.5091          |
-| 0.4794        | 15.9995 | 17425 | 3.5180   | 0.5089          |
-| 0.4553        | 16.9993 | 18514 | 3.5552   | 0.5088          |
-| 0.4275        | 18.0    | 19604 | 3.6266   | 0.5086          |
-| 0.4086        | 18.9998 | 20693 | 3.6208   | 0.5073          |
-| 0.3824        | 19.9977 | 21780 | 3.6292   | 0.5089          |
 ### Framework versions

 base_model: Qwen/Qwen1.5-4B
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
 - name: lmind_hotpot_train8000_eval7405_v1_doc_qa_Qwen_Qwen1.5-4B_lora2
+  results: []
 library_name: peft
 ---
 # lmind_hotpot_train8000_eval7405_v1_doc_qa_Qwen_Qwen1.5-4B_lora2
+This model is a fine-tuned version of [Qwen/Qwen1.5-4B](https://huggingface.co/Qwen/Qwen1.5-4B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.6298
+- Accuracy: 0.5108
 ## Model description
 | 0.919         | 8.0     | 8714  | 0.5077   | 3.1141          |
 | 0.833         | 8.9998  | 9803  | 0.5084   | 3.1755          |
 | 0.7635        | 9.9977  | 10890 | 0.5085   | 3.3117          |
+| 0.6899        | 10.9998 | 11979 | 3.3147   | 0.5072          |
+| 0.6427        | 11.9995 | 13068 | 3.4025   | 0.5101          |
+| 0.604         | 12.9993 | 14157 | 3.3905   | 0.5103          |
+| 0.5507        | 14.0    | 15247 | 3.4740   | 0.5088          |
+| 0.5099        | 14.9998 | 16336 | 3.4772   | 0.5085          |
+| 0.478         | 15.9995 | 17425 | 3.5259   | 0.5088          |
+| 0.4545        | 16.9993 | 18514 | 3.5391   | 0.5094          |
+| 0.427         | 18.0    | 19604 | 3.5887   | 0.5095          |
+| 0.4083        | 18.9998 | 20693 | 3.5945   | 0.5097          |
+| 0.3818        | 19.9977 | 21780 | 3.6298   | 0.5108          |
 ### Framework versions