GGUF
llama-cpp
gguf-my-repo
Inference Endpoints
ApprikatAI commited on
Commit
d9c52d8
·
verified ·
1 Parent(s): d81153a

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +32 -0
README.md ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: amd/AMD-Llama-135m-code
3
+ datasets:
4
+ - cerebras/SlimPajama-627B
5
+ - manu/project_gutenberg
6
+ license: apache-2.0
7
+ tags:
8
+ - llama-cpp
9
+ - gguf-my-repo
10
+ ---
11
+
12
+ # ApprikatAI/AMD-Llama-135m-code-FP16-GGUF
13
+ This model was converted to GGUF format from [`amd/AMD-Llama-135m-code`](https://huggingface.co/amd/AMD-Llama-135m-code) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
14
+ Refer to the [original model card](https://huggingface.co/amd/AMD-Llama-135m-code) for more details on the model.
15
+
16
+ ## Use with llama.cpp
17
+ Install llama.cpp through brew (works on Mac and Linux)
18
+
19
+ ```bash
20
+ brew install llama.cpp
21
+ ```
22
+ Invoke the llama.cpp server or the CLI.
23
+
24
+ ### CLI:
25
+ ```bash
26
+ llama-cli --hf-repo ApprikatAI/AMD-Llama-135m-code-FP16-GGUF --hf-file amd-llama-135m-code-fp16.gguf -p "The meaning to life and the universe is"
27
+ ```
28
+
29
+ ### Server:
30
+ ```bash
31
+ llama-server --hf-repo ApprikatAI/AMD-Llama-135m-code-FP16-GGUF --hf-file amd-llama-135m-code-fp16.gguf -c 2048
32
+ ```