End of training
Browse files- README.md +5 -5
- adapter_model.bin +1 -1
README.md
CHANGED
@@ -91,7 +91,7 @@ wandb_entity: null
|
|
91 |
wandb_mode: online
|
92 |
wandb_name: null
|
93 |
wandb_project: Gradients-On-Demand
|
94 |
-
wandb_run:
|
95 |
wandb_runid: null
|
96 |
warmup_steps: 10
|
97 |
weight_decay: 0.0
|
@@ -105,7 +105,7 @@ xformers_attention: null
|
|
105 |
|
106 |
This model is a fine-tuned version of [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b) on the None dataset.
|
107 |
It achieves the following results on the evaluation set:
|
108 |
-
- Loss: 1.
|
109 |
|
110 |
## Model description
|
111 |
|
@@ -143,9 +143,9 @@ The following hyperparameters were used during training:
|
|
143 |
| Training Loss | Epoch | Step | Validation Loss |
|
144 |
|:-------------:|:------:|:----:|:---------------:|
|
145 |
| 2.2598 | 0.0187 | 1 | 2.2258 |
|
146 |
-
| 2.
|
147 |
-
| 2.
|
148 |
-
| 1.
|
149 |
|
150 |
|
151 |
### Framework versions
|
|
|
91 |
wandb_mode: online
|
92 |
wandb_name: null
|
93 |
wandb_project: Gradients-On-Demand
|
94 |
+
wandb_run: htga
|
95 |
wandb_runid: null
|
96 |
warmup_steps: 10
|
97 |
weight_decay: 0.0
|
|
|
105 |
|
106 |
This model is a fine-tuned version of [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b) on the None dataset.
|
107 |
It achieves the following results on the evaluation set:
|
108 |
+
- Loss: 1.5405
|
109 |
|
110 |
## Model description
|
111 |
|
|
|
143 |
| Training Loss | Epoch | Step | Validation Loss |
|
144 |
|:-------------:|:------:|:----:|:---------------:|
|
145 |
| 2.2598 | 0.0187 | 1 | 2.2258 |
|
146 |
+
| 2.2198 | 0.0561 | 3 | 2.2064 |
|
147 |
+
| 2.1182 | 0.1121 | 6 | 1.8800 |
|
148 |
+
| 1.7179 | 0.1682 | 9 | 1.5405 |
|
149 |
|
150 |
|
151 |
### Framework versions
|
adapter_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 84047370
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:abd4b276c4e2de891da8ab9d673e4dfdf048992593ba8e1a36e61fa5f0940f34
|
3 |
size 84047370
|