File size: 1,232 Bytes
efa14b2
 
 
 
 
 
 
 
16158bb
 
efa14b2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
---
license: bsd
---

Converted INT8/INT4 files for [fastllm](https://github.com/ztxz16/fastllm) with [baichuan-13b-chat](https://huggingface.co/baichuan-inc/Baichuan-13B-Chat)

Directly download from Baidu Netdisk:

Link:https://pan.baidu.com/s/1zADu6rd749zkkNAfl-aqtg 
Code:jqkl 

Updated time: 2023/07/27

```
baichuan-13b-chat-int4.flm:

+-----------------------------------------------------------------------------+
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
| N/A   53C    P0    56W / 250W |   7083MiB / 23040MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+

baichuan-13b-chat-int8.flm:

+-----------------------------------------------------------------------------+
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
| N/A   51C    P0   162W / 250W |  13151MiB / 23040MiB |     95%      Default |
+-------------------------------+----------------------+----------------------+
        
```

```python
from fastllm_pytools import llm
model = llm.model("baichuan-13b-chat-int4.flm")
for response in model.stream_response("介绍一下南京"):
    print(response, flush = True, end = "")
```