Difference between RoBERTa-base and RoBERTa?

by mjw - opened Aug 25, 2022

mjw

Aug 25, 2022

Hi all,

I'm currently conducting some NLP research and i'm trying to understand the difference between RoBERTa (https://huggingface.co/docs/transformers/model_doc/roberta) and RoBERTa-base (https://huggingface.co/roberta-base).

I've read several pages online but it's still not very clear. It seems as though RoBERTa-base is just a RoBERTa model with default configuration?

Could someone advise please?

Thanks!

Facebook AI community org Aug 26, 2022

Hi!

RoBERTa (https://huggingface.co/docs/transformers/model_doc/roberta) is the architecture, while RoBERTa-base (https://huggingface.co/roberta-base) is one particular checkpoint using this architecture.

An architecture + a checkpoint constitute a "model" (the term model is a bit ambiguous)

Hope this helps!

mjw

Sep 19, 2022

Yes this helps very much, thanks @julien-c !

mjw changed discussion status to closed Sep 19, 2022

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment