CaterinaLac's picture
Update README.md
de2f798
|
raw
history blame
278 Bytes
metadata
license: apache-2.0
datasets:
  - shibing624/sharegpt_gpt4
language:
  - en
  - zh
  - fr
  - es

Model Card

Pythia-70m-deduped finetuned on a cleaned version of ShareGPT data.
The cleaned dataset is obtained by removing duplicates and paraphrases from the original corpus.