File size: 1,853 Bytes
070bf9b 249b43d 587d2ab 391a59f 249b43d c137590 1cd7f7c c137590 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 |
---
widget:
- text: "Earth [MASK] is a growing field."
- text: "Multiple [MASK] channels enable full polarimetry"
- text: "The [MASK] is capable of measuring in limb and nadir geometry"
---
# RemoteSensing Distilbert
![alt text](https://media.istockphoto.com/photos/space-communications-satellite-in-low-orbit-around-the-earth-elements-picture-id1062473882?b=1&k=20&m=1062473882&s=170667a&w=0&h=KWJwGSiXBffLgKdaQTxY-eY7ljJE5_3khXgQyAQHPbU=)
The field of earth observation is increasingly growing. More and more data scientists are interested about this domain, and they're developing computer vision applications that do amazing things, while NLP doesn't seem to be given much consideration in this area
That's why I posted [Chramer/remote-sensing-distilbert-cased](https://huggingface.co/Chramer/remote-sensing-distilbert-cased). This is masked language model trained on a corpus of technical information about space missions, instruments, and sensors.
The model is based on [distilbert-base-cased](https://huggingface.co/distilbert-base-uncased), but I didn't have the chance to play with the hyperparameters of the model because of the limited computational capabilities I have. So there's a lot to improve! 😆
It was fun to publish my first model on hugging face! 🤩
**Author:** Marcello Politi ([Twitter 🐦](https://twitter.com/_March08_) ,[LinkedIn 💼](https://www.linkedin.com/in/marcello-politi/)).
# Perplexity
Test set: 4.5k sentences about technical space stuff.
| Model | Perplexity |
| ------ | ------ |
| remote-sensing-distilbert-cased | **6.45** |
| distilbert-base-cased | 33.77 |
# Usage
```python
from transformers import AutoModel, AutoTokenizer
model_name = "Chramer/remote-sensing-distilbert-cased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)
```
|