distilbert-mlm-750k / README.md
nreimers's picture
upload
8bd8ce4

distilbert-base-uncased trained for 750K steps with batch size 64 on C4, MSMARCO, Wikipedia, S2ORC, News