iwaitu
/

llama-3.1-70b-chinese-chat-FP8

Text Generation

text-generation-inference

Model card Files Files and versions Community

base on shenzhi-wang/Llama3.1-70B-Chinese-Chat

推荐使用 2x H100 来运行

或者使用4x RTX6000 ADA 48G 来运行

Downloads last month: 11

Safetensors

Model size

70.6B params

Tensor type

BF16

·

F8_E4M3

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for iwaitu/llama-3.1-70b-chinese-chat-FP8

Base model

meta-llama/Llama-3.1-70B

Finetuned

meta-llama/Llama-3.1-70B-Instruct

Quantized

shenzhi-wang/Llama3.1-70B-Chinese-Chat

Quantized

(4)

this model