Update README.md
Browse files
README.md
CHANGED
@@ -138,9 +138,13 @@ F1-micro: 0.85, F1-macro: 0.52, F1-strict: 0.71
|
|
138 |
|
139 |
## Technical Specifications
|
140 |
|
141 |
-
### Model
|
142 |
|
143 |
-
|
|
|
|
|
|
|
|
|
144 |
|
145 |
### Compute Infrastructure
|
146 |
|
|
|
138 |
|
139 |
## Technical Specifications
|
140 |
|
141 |
+
### Model Objective
|
142 |
|
143 |
+
MLM is a technique in which you take your tokenized sample and replace some of the tokens with the <mask> token and train your model with it. The model then tries to predict what should come in the place of that <mask> token and gradually starts learning about the data. MLM teaches the model about the relationship between words
|
144 |
+
|
145 |
+
Eg. Suppose you have a sentence - 'Deep Learning is so cool! I love neural networks.', now replace few words with the <mask> token.
|
146 |
+
|
147 |
+
Masked Sentence - 'Deep Learning is so <mask>! I love <mask> networks.'
|
148 |
|
149 |
### Compute Infrastructure
|
150 |
|