gorkemgoknar commited on
Commit
71313bd
1 Parent(s): 21773d5

Update README.md

Browse files

added epoch details for perp 51.8 model

Files changed (1) hide show
  1. README.md +12 -10
README.md CHANGED
@@ -36,7 +36,7 @@ Using Google Colab for training.
36
 
37
  Additional tutorial and source will be in https://github.com/gorkemgoknar in later stage.
38
 
39
- Current accuracy 28.9 % , Perplexity : 86.71
40
 
41
  Models are available:
42
 
@@ -78,8 +78,8 @@ predicted_text = tokenizer.decode([predicted_index])
78
  print('input text:', text)
79
  print('predicted text:', predicted_text)
80
 
81
- # input text: Quem era Jim Henson? Jim Henson era um
82
- # predicted text: homem
83
 
84
  ```
85
 
@@ -102,8 +102,7 @@ for i, sample_output in enumerate(sample_outputs):
102
  print(">> Generated text {}\n\n{}".format(i+1, tokenizer.decode(sample_output.tolist())))
103
 
104
  # >> Generated text
105
- # Quem era Jim Henson? Jim Henson era um executivo de televisão e diretor de um grande estúdio de cinema mudo chamado Selig,
106
- # depois que o diretor de cinema mudo Georges Seuray dirigiu vários filmes para a Columbia e o estúdio.
107
 
108
  ```
109
 
@@ -121,12 +120,15 @@ Wikipedia Turkish article dump as of 28-10-2020
121
 
122
  ## Eval results
123
 
124
- epoch train_loss valid_loss accuracy perplexity time
125
- 0 6.922922 6.653488 0.148002 775.484253 2:26:41
126
- 1 4.799396 4.633522 0.277028 102.875755 3:03:38
127
- 2 4.610025 4.462641 0.289884 86.716248 2:34:50
128
-
 
 
129
 
 
130
 
131
  ```
132
 
 
36
 
37
  Additional tutorial and source will be in https://github.com/gorkemgoknar in later stage.
38
 
39
+ Current accuracy 33 % , Perplexity : 51.88
40
 
41
  Models are available:
42
 
 
78
  print('input text:', text)
79
  print('predicted text:', predicted_text)
80
 
81
+ # input text:
82
+ # predicted text:
83
 
84
  ```
85
 
 
102
  print(">> Generated text {}\n\n{}".format(i+1, tokenizer.decode(sample_output.tolist())))
103
 
104
  # >> Generated text
105
+ #
 
106
 
107
  ```
108
 
 
120
 
121
  ## Eval results
122
 
123
+ | epoch |train_loss |valid_loss |accuracy |perplexity |time |
124
+ | ----- | -------- |--------- | ---------- | --------- | ----- |
125
+ |0 |4.777015 |4.621834 |0.292547 |101.680367 |2:42:05|
126
+ |1 |4.509412 |4.403999 |0.305574 |81.777267 |1:09:38|
127
+ |2 |4.169529 |4.120755 |0.324908 |61.605747 |1:07:45|
128
+ |3 |4.293973 |4.177899 |0.317211 |65.228653 |1:07:02|
129
+ |4 |4.049848 |3.949103 |0.338347 |51.888783 |1:05:53|
130
 
131
+ #Epoch 0 on Tesla T4, others on V100
132
 
133
  ```
134