File size: 630 Bytes
a1f93e9
adffeb2
a2580f9
a1f93e9
 
 
7165db7
a1f93e9
 
adffeb2
2fa4e4c
 
fd3be03
a1f93e9
 
726a01e
 
 
dbf8ee3
 
fd3be03
726a01e
 
8344bac
 
 
 
 
fd3be03
dbf8ee3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8344bac
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
title: Self Chat
emoji: πŸ€–~πŸ€–
colorFrom: yellow
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: apache-2.0
tags:
  - chatbot
short_description: Generating synthetic data via self-chatting
---




## Dependency

Install llama-cpp-python with the following script
```sh
pip install git+https://github.com/abetlen/llama-cpp-python.git -C cmake.args="-DGGML_BLAS=ON;-DGGML_BLAS_VENDOR=OpenBLAS"
```




## Local Inference

```sh
python models/cpp_qwen2.py
```

## Serverless Inference


```sh
python client_gradio.py
```


For streaming inference
```sh
python client_streaming.py
```