|
--- |
|
tags: |
|
- llm-rs |
|
- ggml |
|
pipeline_tag: text-generation |
|
license: apache-2.0 |
|
language: |
|
- en |
|
--- |
|
|
|
# GGML converted versions of [EleutherAI](https://huggingface.co/EleutherAI)'s Pythia models |
|
|
|
## Description: |
|
The *Pythia Scaling Suite* is a collection of models developed to facilitate |
|
interpretability research. It contains two sets of eight models of sizes |
|
70M, 160M, 410M, 1B, 1.4B, 2.8B, 6.9B, and 12B. For each size, there are two |
|
models: one trained on the Pile, and one trained on the Pile after the dataset |
|
has been globally deduplicated. All 8 model sizes are trained on the exact |
|
same data, in the exact same order. We also provide 154 intermediate |
|
checkpoints per model, hosted on Hugging Face as branches. |
|
|
|
The Pythia model suite was deliberately designed to promote scientific |
|
research on large language models, especially interpretability research. |
|
Despite not centering downstream performance as a design goal, we find the |
|
models match or exceed the performance of |
|
similar and same-sized models, such as those in the OPT and GPT-Neo suites. |
|
|
|
## Converted Models: |
|
|
|
| Name | Based on | Type | Container | |
|
|-|-|-|-| |
|
| [pythia-70m-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-70m-f16.bin) | [Pythia-70M](https://huggingface.co/EleutherAI/pythia-70m) | fp16 | GGML | |
|
| [pythia-70m-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-70m-q4_0-ggjt.bin) | [Pythia-70M](https://huggingface.co/EleutherAI/pythia-70m) | int4 | GGJT | |
|
| [pythia-70m-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-70m-q4_0.bin) | [Pythia-70M](https://huggingface.co/EleutherAI/pythia-70m) | int4 | GGML | |
|
| [pythia-160m-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-160m-f16.bin) | [Pythia-160M](https://huggingface.co/EleutherAI/pythia-160m) | fp16 | GGML | |
|
| [pythia-160m-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-160m-q4_0-ggjt.bin) | [Pythia-160M](https://huggingface.co/EleutherAI/pythia-160m) | int4 | GGJT | |
|
| [pythia-160m-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-160m-q4_0.bin) | [Pythia-160M](https://huggingface.co/EleutherAI/pythia-160m) | int4 | GGML | |
|
| [pythia-410m-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-410m-f16.bin) | [Pythia-410M](https://huggingface.co/EleutherAI/pythia-410m) | fp16 | GGML | |
|
| [pythia-410m-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-410m-q4_0-ggjt.bin) | [Pythia-410M](https://huggingface.co/EleutherAI/pythia-410m) | int4 | GGJT | |
|
| [pythia-410m-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-410m-q4_0.bin) | [Pythia-410M](https://huggingface.co/EleutherAI/pythia-410m) | int4 | GGML | |
|
| [pythia-1b-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1b-f16.bin) | [Pythia-1B](https://huggingface.co/EleutherAI/pythia-1b) | fp16 | GGML | |
|
| [pythia-1b-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1b-q4_0-ggjt.bin) | [Pythia-1B](https://huggingface.co/EleutherAI/pythia-1b) | int4 | GGJT | |
|
| [pythia-1b-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1b-q4_0.bin) | [Pythia-1B](https://huggingface.co/EleutherAI/pythia-1b) | int4 | GGML | |
|
| [pythia-1.4b-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1.4b-f16.bin) | [Pythia-1.4B](https://huggingface.co/EleutherAI/pythia-1.4b) | fp16 | GGML | |
|
| [pythia-1.4b-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1.4b-q4_0-ggjt.bin) | [Pythia-1.4B](https://huggingface.co/EleutherAI/pythia-1.4b) | int4 | GGJT | |
|
| [pythia-1.4b-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1.4b-q4_0.bin) | [Pythia-1.4B](https://huggingface.co/EleutherAI/pythia-1.4b) | int4 | GGML | |
|
| [pythia-2.8b-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-2.8b-f16.bin) | [Pythia-2.8B](https://huggingface.co/EleutherAI/pythia-2.8b) | fp16 | GGML | |
|
| [pythia-2.8b-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-2.8b-q4_0-ggjt.bin) | [Pythia-2.8B](https://huggingface.co/EleutherAI/pythia-2.8b) | int4 | GGJT | |
|
| [pythia-2.8b-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-2.8b-q4_0.bin) | [Pythia-2.8B](https://huggingface.co/EleutherAI/pythia-2.8b) | int4 | GGML | |
|
|
|
## Usage |
|
|
|
### Python via [llm-rs](https://github.com/LLukas22/llm-rs-python): |
|
|
|
#### Installation |
|
Via pip: `pip install llm-rs` |
|
|
|
#### Run inference |
|
```python |
|
from llm_rs import AutoModel |
|
|
|
#Load the model, define any model you like from the list above as the `model_file` |
|
model = AutoModel.from_pretrained("Rustformers/pythia-ggml",model_file="pythia-70m-q4_0-ggjt.bin") |
|
|
|
#Generate |
|
print(model.generate("The meaning of life is")) |
|
``` |
|
|
|
### Rust via [Rustformers/llm](https://github.com/rustformers/llm): |
|
|
|
#### Installation |
|
``` |
|
git clone --recurse-submodules [email protected]:rustformers/llm.git |
|
cargo build --release |
|
``` |
|
|
|
#### Run inference |
|
``` |
|
cargo run --release -- gptneox infer -m path/to/model.bin -p "Tell me how cool the Rust programming language is:" |
|
``` |
|
|