Spaces:

GerbilLab
/

README

Running

crumb commited on Mar 30, 2023

Commit

5ca2943

1 Parent(s): 96fd405

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -52,6 +52,6 @@ Nearly every base model that isn't finetuned for a specific task was trained on
 ```
-The only application where I can imagine these being useful in the slightest is warm-starting very small encoder-decoder models or fitting a new scaling law that takes into account smaller models. Every model was trained on a singular GPU, either a RTX2060, RTX3060, or a T4.
 I'd , uh , appreciate help in evaluating all these models probably with lm harness

 ```
+The only application where I can imagine these being useful in the slightest is warm-starting very small encoder-decoder models or fitting a new scaling law that takes into account smaller models. They also could be usable on their own when finetuned on more specific datasets. Every model was trained on a singular GPU, either a RTX2060, RTX3060, or a T4.
 I'd , uh , appreciate help in evaluating all these models probably with lm harness