binh230 commited on
Commit
4406b21
1 Parent(s): fb0a953

Model save

Browse files
Files changed (2) hide show
  1. README.md +2 -2
  2. config.json +1 -1
README.md CHANGED
@@ -11,7 +11,7 @@ model-index:
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/truonggiabjnh2003-fpt-university/huggingface/runs/qhrqwikf)
15
  # finetune_bge_test
16
 
17
  This model is a fine-tuned version of [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3) on an unknown dataset.
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
44
  - total_eval_batch_size: 16
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
- - num_epochs: 1.0
48
 
49
  ### Training results
50
 
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/truonggiabjnh2003-fpt-university/huggingface/runs/n70u3kkw)
15
  # finetune_bge_test
16
 
17
  This model is a fine-tuned version of [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3) on an unknown dataset.
 
44
  - total_eval_batch_size: 16
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 4.0
48
 
49
  ### Training results
50
 
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "/kaggle/working/model_final/checkpoint-5020",
3
  "architectures": [
4
  "XLMRobertaModel"
5
  ],
 
1
  {
2
+ "_name_or_path": "/kaggle/working/model_final",
3
  "architectures": [
4
  "XLMRobertaModel"
5
  ],