kanelindsay2000's picture
Update README.md
9a06153 verified
metadata
library_name: transformers
pipeline_tag: text-classification
tags:
  - biology
  - herbarium
  - location
language:
  - en

RBG Kew Logo RBG Kew Herbarium Packets

RoBERTa for binary sequence classification fine-tuned to classify text derived from herbarium packets as location sensitive. Fine-tuned with 500,000 cleaned data samples from RBG Kew's Herbarium dataset available on GBIF (https://doi.org/10.15468/ly60bx). Trained primarily for English language but may work with other languages due to the large variety of text present in the Kew Herbarium.