widget:
- text: Earth [MASK] is a growing field.
- text: Multiple [MASK] channels enable full polarimetry
- text: The [MASK] is capable of measuring in limb and nadir geometry
RemoteSensing Distilbert
The field of earth observation is increasingly growing. More and more data scientists are interested about this domain, and they're developing computer vision applications that do amazing things, while NLP doesn't seem to be given much consideration in this area That's why I posted Chramer/remote-sensing-distilbert-cased. This is masked language model trained on a corpus of technical information about space missions, instruments, and sensors.
The model is based on distilbert-base-cased, but I didn't have the chance to play with the hyperparameters of the model because of the limited computational capabilities I have. So there's a lot to improve! 😆
It was fun to publish my first model on hugging face! 🤩
Author: Marcello Politi (Twitter 🐦 ,LinkedIn 💼).
Perplexity
Test set: 4.5k sentences about technical space stuff.
Model | Perplexity |
---|---|
remote-sensing-distilbert-cased | 6.45 |
distilbert-base-cased | 33.77 |
Usage
from transformers import AutoModel, AutoTokenizer
model_name = "Chramer/remote-sensing-distilbert-cased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)