garage-bAInd
/

GPlatty-30B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lilloukas commited on Jun 28, 2023

Commit

458318d

•

1 Parent(s): 02008e1

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -41,7 +41,7 @@ We use state-of-the-art [Language Model Evaluation Harness](https://github.com/E
 ## Reproducing Evaluation Results
-Install LM Evaluation Harness
 ```
 git clone https://github.com/EleutherAI/lm-evaluation-harness
 cd lm-evaluation-harness
@@ -49,22 +49,22 @@ pip install -e .
 ```
 Each task was evaluated on a single A100 80GB GPU.
-ARC
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/GPlatty-30B --tasks arc_challenge --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/arc_challenge_25shot.json --device cuda --num_fewshot 25
 ```
-HellaSwag
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/GPlatty-30B --tasks hellaswag --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/hellaswag_10shot.json --device cuda --num_fewshot 10
 ```
-MMLU
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/GPlatty-30B --tasks hendrycksTest-* --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/mmlu_5shot.json --device cuda --num_fewshot 5
 ```
-TruthfulQA
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/GPlatty-30B --tasks truthfulqa_mc --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/truthfulqa_0shot.json --device cuda
 ```

 ## Reproducing Evaluation Results
+Install LM Evaluation Harness:
 ```
 git clone https://github.com/EleutherAI/lm-evaluation-harness
 cd lm-evaluation-harness
 ```
 Each task was evaluated on a single A100 80GB GPU.
+ARC:
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/GPlatty-30B --tasks arc_challenge --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/arc_challenge_25shot.json --device cuda --num_fewshot 25
 ```
+HellaSwag:
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/GPlatty-30B --tasks hellaswag --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/hellaswag_10shot.json --device cuda --num_fewshot 10
 ```
+MMLU:
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/GPlatty-30B --tasks hendrycksTest-* --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/mmlu_5shot.json --device cuda --num_fewshot 5
 ```
+TruthfulQA:
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/GPlatty-30B --tasks truthfulqa_mc --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/truthfulqa_0shot.json --device cuda
 ```