dusty-nv
/

DeepSeek-R1-Distill-Llama-8B-q4f16_ft-MLC

Model card Files Files and versions Community

dusty-nv commited on Jan 29

Commit

2128956

verified ·

1 Parent(s): 0e3257a

Upload folder using huggingface_hub

Browse files

Files changed (1) hide show

README.md +15 -15

README.md CHANGED Viewed

@@ -1,19 +1,19 @@
 # DeepSeek-R1-Distill-Llama-8B-q4f16_ft-MLC
-| Model Configuration   |                   Value                    |
-|----------------------:|:------------------------------------------:|
-| Source Model          | `deepseek-ai/DeepSeek-R1-Distill-Llama-8B` |
-| Inference API         |                 `MLC_LLM`                  |
-| Quantization          |                 `q4f16_ft`                 |
-| Model Type            |                  `llama`                   |
-| Vocab Size            |                  `128256`                  |
-| Context Window Size   |                  `131072`                  |
-| Prefill Chunk Size    |                   `8192`                   |
-| Temperature           |                   `0.6`                    |
-| Repetition Penalty    |                   `1.0`                    |
-| `top_p`               |                   `0.95`                   |
-| `pad_token_id`        |                    `0`                     |
-| `bos_token_id`        |                  `128000`                  |
-| `eos_token_id`        |                  `128001`                  |
 See [`jetson-ai-lab.com/models.html`](https://jetson-ai-lab.com/models.html) for benchmarks, examples, and containers to deploy local serving and inference for these quantized models.

 # DeepSeek-R1-Distill-Llama-8B-q4f16_ft-MLC
+|                     |                                              Model Configuration                                              |
+|---------------------|:-------------------------------------------------------------------------------------------------------------:|
+| Source Model        | [`deepseek-ai/DeepSeek-R1-Distill-Llama-8B`](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B) |
+| Inference API       |                                                   `MLC_LLM`                                                   |
+| Quantization        |                                                  `q4f16_ft`                                                   |
+| Model Type          |                                                    `llama`                                                    |
+| Vocab Size          |                                                   `128256`                                                    |
+| Context Window Size |                                                   `131072`                                                    |
+| Prefill Chunk Size  |                                                    `8192`                                                     |
+| Temperature         |                                                     `0.6`                                                     |
+| Repetition Penalty  |                                                     `1.0`                                                     |
+| `top_p`             |                                                    `0.95`                                                     |
+| `pad_token_id`      |                                                      `0`                                                      |
+| `bos_token_id`      |                                                   `128000`                                                    |
+| `eos_token_id`      |                                                   `128001`                                                    |
 See [`jetson-ai-lab.com/models.html`](https://jetson-ai-lab.com/models.html) for benchmarks, examples, and containers to deploy local serving and inference for these quantized models.