dusty-nv commited on
Commit
2128956
·
verified ·
1 Parent(s): 0e3257a

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +15 -15
README.md CHANGED
@@ -1,19 +1,19 @@
1
  # DeepSeek-R1-Distill-Llama-8B-q4f16_ft-MLC
2
 
3
- | Model Configuration | Value |
4
- |----------------------:|:------------------------------------------:|
5
- | Source Model | `deepseek-ai/DeepSeek-R1-Distill-Llama-8B` |
6
- | Inference API | `MLC_LLM` |
7
- | Quantization | `q4f16_ft` |
8
- | Model Type | `llama` |
9
- | Vocab Size | `128256` |
10
- | Context Window Size | `131072` |
11
- | Prefill Chunk Size | `8192` |
12
- | Temperature | `0.6` |
13
- | Repetition Penalty | `1.0` |
14
- | `top_p` | `0.95` |
15
- | `pad_token_id` | `0` |
16
- | `bos_token_id` | `128000` |
17
- | `eos_token_id` | `128001` |
18
 
19
  See [`jetson-ai-lab.com/models.html`](https://jetson-ai-lab.com/models.html) for benchmarks, examples, and containers to deploy local serving and inference for these quantized models.
 
1
  # DeepSeek-R1-Distill-Llama-8B-q4f16_ft-MLC
2
 
3
+ | | Model Configuration |
4
+ |---------------------|:-------------------------------------------------------------------------------------------------------------:|
5
+ | Source Model | [`deepseek-ai/DeepSeek-R1-Distill-Llama-8B`](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B) |
6
+ | Inference API | `MLC_LLM` |
7
+ | Quantization | `q4f16_ft` |
8
+ | Model Type | `llama` |
9
+ | Vocab Size | `128256` |
10
+ | Context Window Size | `131072` |
11
+ | Prefill Chunk Size | `8192` |
12
+ | Temperature | `0.6` |
13
+ | Repetition Penalty | `1.0` |
14
+ | `top_p` | `0.95` |
15
+ | `pad_token_id` | `0` |
16
+ | `bos_token_id` | `128000` |
17
+ | `eos_token_id` | `128001` |
18
 
19
  See [`jetson-ai-lab.com/models.html`](https://jetson-ai-lab.com/models.html) for benchmarks, examples, and containers to deploy local serving and inference for these quantized models.