sakib131 commited on
Commit
8e3e178
1 Parent(s): 0e19cf8

sakib131/bangla-conv-summarizer-model

Browse files
Files changed (2) hide show
  1. README.md +5 -30
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,9 +1,7 @@
1
  ---
2
- base_model: csebuetnlp/banglat5
3
  tags:
4
  - generated_from_trainer
5
- metrics:
6
- - rouge
7
  model-index:
8
  - name: bangla-conv-summarizer-model
9
  results: []
@@ -14,14 +12,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # bangla-conv-summarizer-model
16
 
17
- This model is a fine-tuned version of [csebuetnlp/banglat5](https://huggingface.co/csebuetnlp/banglat5) on an unknown dataset.
18
- It achieves the following results on the evaluation set:
19
- - Loss: nan
20
- - Rouge1: 0.0
21
- - Rouge2: 0.0
22
- - Rougel: 0.0
23
- - Rougelsum: 0.0
24
- - Gen Len: 0.0
25
 
26
  ## Model description
27
 
@@ -41,32 +32,16 @@ More information needed
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 2e-05
44
- - train_batch_size: 8
45
- - eval_batch_size: 8
46
  - seed: 42
47
  - gradient_accumulation_steps: 2
48
- - total_train_batch_size: 16
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
  - num_epochs: 10
52
  - mixed_precision_training: Native AMP
53
 
54
- ### Training results
55
-
56
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
- |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
58
- | 0.0 | 1.0 | 921 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
59
- | 0.0 | 2.0 | 1842 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
60
- | 0.0 | 3.0 | 2763 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
61
- | 0.0 | 4.0 | 3684 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
62
- | 0.0 | 5.0 | 4605 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
63
- | 0.0 | 6.0 | 5526 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
64
- | 0.0 | 7.0 | 6447 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
65
- | 0.0 | 8.0 | 7368 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
66
- | 0.0 | 9.0 | 8289 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
67
- | 0.0 | 10.0 | 9210 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
68
-
69
-
70
  ### Framework versions
71
 
72
  - Transformers 4.35.2
 
1
  ---
2
+ base_model: sakib131/bangla-conv-summarizer-model
3
  tags:
4
  - generated_from_trainer
 
 
5
  model-index:
6
  - name: bangla-conv-summarizer-model
7
  results: []
 
12
 
13
  # bangla-conv-summarizer-model
14
 
15
+ This model is a fine-tuned version of [sakib131/bangla-conv-summarizer-model](https://huggingface.co/sakib131/bangla-conv-summarizer-model) on an unknown dataset.
 
 
 
 
 
 
 
16
 
17
  ## Model description
18
 
 
32
 
33
  The following hyperparameters were used during training:
34
  - learning_rate: 2e-05
35
+ - train_batch_size: 2
36
+ - eval_batch_size: 2
37
  - seed: 42
38
  - gradient_accumulation_steps: 2
39
+ - total_train_batch_size: 4
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
  - num_epochs: 10
43
  - mixed_precision_training: Native AMP
44
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
45
  ### Framework versions
46
 
47
  - Transformers 4.35.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9f657af116fc1c7d4574e14a4c2dda170046a40e6392867a1575f5b3ad439dde
3
  size 1625422896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:762243cb0eb5b5b2bda6c19986ddd61458f9d2f5cf6dffe0a1be3f30c312c7cf
3
  size 1625422896