wassemgtk commited on
Commit
d6f4eb6
1 Parent(s): 91a6c0d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -16,7 +16,7 @@ library_name: transformers
16
  license: cc-by-4.0
17
 
18
 
19
- # Palmyra Long 3B (8k tokens)
20
 
21
  <style>
22
  img {
@@ -29,7 +29,7 @@ img {
29
 
30
  ## Model Description
31
 
32
- Palmyra Long was primarily pre-trained with English text. Note that there is still a trace amount of non-English data present within the training corpus that was accessed through CommonCrawl. A causal language modeling (CLM) objective was utilized during the process of the model's pretraining. Similar to GPT-3, Palmyra Long is a member of the same family of models that only contain a decoder. As a result, it was pre-trained utilizing the objective of self-supervised causal language modeling. Palmyra Long uses the prompts and general experimental setup from GPT-3 in order to conduct its evaluation per GPT-3.
33
 
34
  ## Use case
35
  Palmyra Long is the fastest of Writer’s LLMs and can perform important tasks such as text parsing, simple classification, address correction, and keyword recognition. Providing more context drives even better performance.
 
16
  license: cc-by-4.0
17
 
18
 
19
+ # Palmyra 3B
20
 
21
  <style>
22
  img {
 
29
 
30
  ## Model Description
31
 
32
+ Palmyra 3B was primarily pre-trained with English text. Note that there is still a trace amount of non-English data present within the training corpus that was accessed through CommonCrawl. A causal language modeling (CLM) objective was utilized during the process of the model's pretraining. Similar to GPT-3, Palmyra Long is a member of the same family of models that only contain a decoder. As a result, it was pre-trained utilizing the objective of self-supervised causal language modeling. Palmyra Long uses the prompts and general experimental setup from GPT-3 in order to conduct its evaluation per GPT-3.
33
 
34
  ## Use case
35
  Palmyra Long is the fastest of Writer’s LLMs and can perform important tasks such as text parsing, simple classification, address correction, and keyword recognition. Providing more context drives even better performance.