MarioBarbeque
/

DistilBERT-DeNiro

Model card Files Files and versions

MarioBarbeque commited on Dec 2, 2024

Commit

b36efcd

·

verified ·

1 Parent(s): c880ef7

fix typo

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -129,7 +129,7 @@ Perplexity: [https://en.wikipedia.org/wiki/Perplexity](https://en.wikipedia.org/
 #### Testing Data
 The IMDB dataset from Stanford NLP comes pre-split into training and testing data of 25k reviews each. Our preprocessing, which included the chunking of concatenated, tokenized inputs
-into chunks of 256 tokens, increased these respective splits by approximately ~5k records each. We apply a single masking function to the evluation dataset before testinf as mentioned above.
 ### Results

 #### Testing Data
 The IMDB dataset from Stanford NLP comes pre-split into training and testing data of 25k reviews each. Our preprocessing, which included the chunking of concatenated, tokenized inputs
+into chunks of 256 tokens, increased these respective splits by approximately ~5k records each. We apply a single masking function to the evaluation dataset before testing as mentioned above.
 ### Results