covalenthq
/

cryptoNER

Token Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

ckandemir commited on Oct 28, 2023

Commit

d78b60f

·

1 Parent(s): bdf2097

Update README.md

Files changed (1) hide show

README.md +13 -6

README.md CHANGED Viewed

@@ -3,11 +3,16 @@ license: mit
 base_model: xlm-roberta-base
 tags:
 - generated_from_trainer
 metrics:
 - f1
 model-index:
-- name: xlm-roberta-base-finetuned-NER-crypto
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -22,16 +27,18 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters

 base_model: xlm-roberta-base
 tags:
 - generated_from_trainer
+- NERz
+- crypto
 metrics:
 - f1
 model-index:
+- name: xlm-roberta-base-finetuned-ner-crypto
   results: []
+widget:
+- text: "Didn't I tell you that that was a decent entry point on $PROPHET? If you are in - congrats, Prophet is up 90% in the last 2 weeks and 50% up in the last week alone"
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 ## Model description
+This model is a fine-tuned version of xlm-roberta-base, specializing in Named Entity Recognition (NER) within the cryptocurrency domain. It is optimized to recognize and classify entities such as cryptocurrency ticker symbols, names, and addresses within text.
+## Intended uses
+Designed primarily for NER tasks in the cryptocurrency sector, this model excels in identifying and categorizing ticker symbols, cryptocurrency names, and addresses in textual content.
+## Limitations
+Performance may be subpar when the model encounters entities outside its training data or infrequently occurring entities within the cryptocurrency domain. The model might also be susceptible to variations in entity presentation and context.
+## Training and evaluation data
+The model was trained using a diverse dataset, including artificially generated tweets and ERC20 token metadata fetched through the Covalent API (https://www.covalenthq.com/docs/unified-api/). GPT was employed to generate 500 synthetic tweets tailored for the cryptocurrency domain. The Covalent API was instrumental in obtaining a rich set of 20K+ unique ERC20 token metadata entries, enhancing the model's understanding and recognition of cryptocurrency entities.
 ## Training procedure
 ### Training hyperparameters