Zheng Han's picture

12 1 1

Zheng Han

traphix

·

AI & ML interests

None yet

Recent Activity

new activity 5 days ago

Qwen/QwQ-32B:This model beats Qwen Max!

new activity 6 days ago

Qwen/Qwen2.5-14B-Instruct-1M:Does vllm 0.7.3 support this model？

new activity 20 days ago

cognitivecomputations/DeepSeek-V3-AWQ:Is there any accuracy results comparing to original DeepSeek-V3？

View all activity

Organizations

None yet

traphix's activity

New activity in Qwen/QwQ-32B 5 days ago

This model beats Qwen Max!

#33 opened 7 days ago by

New activity in Qwen/Qwen2.5-14B-Instruct-1M 6 days ago

Does vllm 0.7.3 support this model？

#10 opened 6 days ago by

New activity in cognitivecomputations/DeepSeek-V3-AWQ 20 days ago

Is there any accuracy results comparing to original DeepSeek-V3？

#6 opened 20 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 20 days ago

why "MLA is not supported with awq_marlin quantization. Disabling MLA." with 4090 * 32 (4 node / vllm 0.7.2)

#14 opened 21 days ago by

Is there any accuracy results comparing to original DeepSeek-R1？

#15 opened 20 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 24 days ago

Has anyone evaluated the performance of the AWQ version of the model on benchmarks?

#8 opened 28 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ about 1 month ago

skips the thinking process

#5 opened about 1 month ago by

Deployment framework

#2 opened about 2 months ago by

New activity in cognitivecomputations/DeepSeek-V3-AWQ about 1 month ago

vllm support a100

#2 opened about 2 months ago by

HuggingLianWang

New activity in neuralmagic/Qwen2.5-72B-quantized.w8a8 about 1 month ago

Any plans to quantize Qwen/Qwen2.5-72B-Instruct to w8a8？

#1 opened about 1 month ago by

New activity in neuralmagic/DeepSeek-Coder-V2-Instruct-FP8 7 months ago

Can it run on A100/A800 with VLLM?

#1 opened 8 months ago by

Parkerlambert123

Quantize DeepSeek-Coder-V2-Instruct to W8A8(INT8)?

#2 opened 7 months ago by