File size: 5,223 Bytes
4ef0028
 
 
 
 
acf06dd
 
 
4ef0028
 
acf06dd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
---
tags:
- llm-rs
- ggml
pipeline_tag: text-generation
license: apache-2.0
language:
- en
---

# GGML converted versions of [EleutherAI](https://huggingface.co/EleutherAI)'s Pythia models

## Description:
The *Pythia Scaling Suite* is a collection of models developed to facilitate 
interpretability research. It contains two sets of eight models of sizes 
70M, 160M, 410M, 1B, 1.4B, 2.8B, 6.9B, and 12B. For each size, there are two 
models: one trained on the Pile, and one trained on the Pile after the dataset 
has been globally deduplicated. All 8 model sizes are trained on the exact 
same data, in the exact same order. We also provide 154 intermediate 
checkpoints per model, hosted on Hugging Face as branches.

The Pythia model suite was deliberately designed to promote scientific 
research on large language models, especially interpretability research. 
Despite not centering downstream performance as a design goal, we find the 
models match or exceed the performance of 
similar and same-sized models, such as those in the OPT and GPT-Neo suites.

## Converted Models:

| Name   | Based on |  Type | Container |
|-|-|-|-|
| [pythia-70m-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-70m-f16.bin) |  [Pythia-70M](https://huggingface.co/EleutherAI/pythia-70m) | fp16 | GGML |
| [pythia-70m-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-70m-q4_0-ggjt.bin) |  [Pythia-70M](https://huggingface.co/EleutherAI/pythia-70m) | int4 | GGJT |
| [pythia-70m-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-70m-q4_0.bin) |  [Pythia-70M](https://huggingface.co/EleutherAI/pythia-70m) | int4 | GGML |
| [pythia-160m-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-160m-f16.bin) |  [Pythia-160M](https://huggingface.co/EleutherAI/pythia-160m) | fp16 | GGML |
| [pythia-160m-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-160m-q4_0-ggjt.bin) |  [Pythia-160M](https://huggingface.co/EleutherAI/pythia-160m) | int4 | GGJT |
| [pythia-160m-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-160m-q4_0.bin) |  [Pythia-160M](https://huggingface.co/EleutherAI/pythia-160m) | int4 | GGML |
| [pythia-410m-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-410m-f16.bin) |  [Pythia-410M](https://huggingface.co/EleutherAI/pythia-410m) | fp16 | GGML |
| [pythia-410m-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-410m-q4_0-ggjt.bin) |  [Pythia-410M](https://huggingface.co/EleutherAI/pythia-410m) | int4 | GGJT |
| [pythia-410m-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-410m-q4_0.bin) |  [Pythia-410M](https://huggingface.co/EleutherAI/pythia-410m) | int4 | GGML |
| [pythia-1b-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1b-f16.bin) |  [Pythia-1B](https://huggingface.co/EleutherAI/pythia-1b) | fp16 | GGML |
| [pythia-1b-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1b-q4_0-ggjt.bin) |  [Pythia-1B](https://huggingface.co/EleutherAI/pythia-1b) | int4 | GGJT |
| [pythia-1b-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1b-q4_0.bin) |  [Pythia-1B](https://huggingface.co/EleutherAI/pythia-1b) | int4 | GGML |
| [pythia-1.4b-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1.4b-f16.bin) |  [Pythia-1.4B](https://huggingface.co/EleutherAI/pythia-1.4b) | fp16 | GGML |
| [pythia-1.4b-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1.4b-q4_0-ggjt.bin) |  [Pythia-1.4B](https://huggingface.co/EleutherAI/pythia-1.4b) | int4 | GGJT |
| [pythia-1.4b-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-1.4b-q4_0.bin) |  [Pythia-1.4B](https://huggingface.co/EleutherAI/pythia-1.4b) | int4 | GGML |
| [pythia-2.8b-f16.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-2.8b-f16.bin) |  [Pythia-2.8B](https://huggingface.co/EleutherAI/pythia-2.8b) | fp16 | GGML |
| [pythia-2.8b-q4_0-ggjt.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-2.8b-q4_0-ggjt.bin) |  [Pythia-2.8B](https://huggingface.co/EleutherAI/pythia-2.8b) | int4 | GGJT |
| [pythia-2.8b-q4_0.bin](https://huggingface.co/Rustformers/pythia-ggml/blob/main/pythia-2.8b-q4_0.bin) |  [Pythia-2.8B](https://huggingface.co/EleutherAI/pythia-2.8b) | int4 | GGML |

## Usage

### Python via [llm-rs](https://github.com/LLukas22/llm-rs-python):

#### Installation
Via pip: `pip install llm-rs`

#### Run inference
```python
from llm_rs import AutoModel

#Load the model, define any model you like from the list above as the `model_file`
model = AutoModel.from_pretrained("Rustformers/pythia-ggml",model_file="pythia-70m-q4_0-ggjt.bin")

#Generate
print(model.generate("The meaning of life is"))
```

### Rust via [Rustformers/llm](https://github.com/rustformers/llm): 

#### Installation
```
git clone --recurse-submodules [email protected]:rustformers/llm.git
cargo build --release
```

#### Run inference
```
cargo run --release -- gptneox infer -m path/to/model.bin  -p "Tell me how cool the Rust programming language is:"
```