kkmkorea commited on
Commit
b77f831
1 Parent(s): af980aa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -2
README.md CHANGED
@@ -138,9 +138,13 @@ F1-micro: 0.85, F1-macro: 0.52, F1-strict: 0.71
138
 
139
  ## Technical Specifications
140
 
141
- ### Model Architecture and Objective
142
 
143
- [More Information Needed]
 
 
 
 
144
 
145
  ### Compute Infrastructure
146
 
 
138
 
139
  ## Technical Specifications
140
 
141
+ ### Model Objective
142
 
143
+ MLM is a technique in which you take your tokenized sample and replace some of the tokens with the <mask> token and train your model with it. The model then tries to predict what should come in the place of that <mask> token and gradually starts learning about the data. MLM teaches the model about the relationship between words
144
+
145
+ Eg. Suppose you have a sentence - 'Deep Learning is so cool! I love neural networks.', now replace few words with the <mask> token.
146
+
147
+ Masked Sentence - 'Deep Learning is so <mask>! I love <mask> networks.'
148
 
149
  ### Compute Infrastructure
150