Ben Li's picture

Ben Li

bash99

·

bash99

AI & ML interests

AIGC, stable diffusion, chatgpt

Recent Activity

liked a model about 16 hours ago

appmana/deepseek-v4-int4-int8

new activity 19 days ago

deepseek-ai/DeepSeek-V4-Flash:Unable to run on 2x RTX Pro 6000 (DEEP_GEMM problem)

liked a model 22 days ago

Lorbus/Qwen3.6-27B-int4-AutoRound

View all activity

Organizations

None yet

New activity in deepseek-ai/DeepSeek-V4-Flash 19 days ago

Unable to run on 2x RTX Pro 6000 (DEEP_GEMM problem)

#15 opened 22 days ago by

New activity in Qwen/Qwen3.6-27B 23 days ago

Anyone noticed the diffrence of sampling parameters between 27B and 35B-A3B (both 3.6)

#10 opened 23 days ago by

New activity in mratsim/MiniMax-M2.1-FP8-INT4-AWQ 4 months ago

Should I tuned for this warning?

#4 opened 4 months ago by

New activity in RedHatAI/Qwen3-32B-FP8-dynamic about 1 year ago

How can I repeat the eval results?

#2 opened about 1 year ago by

What is the difference between Qwen/Qwen3-32B-FP8 and this quatinized model？

#1 opened about 1 year ago by

New activity in rhymes-ai/Aria over 1 year ago

llama.cpp support

#1 opened over 1 year ago by

New activity in Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4 over 1 year ago

Any one can use VLLM or any other engine support dynamic batch to run this with more than 1 GPU?

#1 opened over 1 year ago by

New activity in Alibaba-NLP/gte-multilingual-base almost 2 years ago

某些特殊情况匹配排序会有错）

#5 opened almost 2 years ago by

New activity in jondurbin/airoboros-13b-gpt4-1.4 almost 3 years ago

4 bit GPTQ

#1 opened almost 3 years ago by

New activity in coyude/Nous-Hermes-13b-Chinese-plus-GPTQ almost 3 years ago

请问这个带Plus的版本和不带的有什么区别？

#1 opened almost 3 years ago by

New activity in TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ almost 3 years ago

Gibberish on 'latest', with recent qwopqwop GPTQ/triton and ooba?

#2 opened about 3 years ago by

New activity in thatname/Ziya-LLaMA-13B-v1-ggml almost 3 years ago

convert ziya to ggml shell

#1 opened almost 3 years ago by

New activity in anon8231489123/vicuna-13b-GPTQ-4bit-128g about 3 years ago

Vram usage

#3 opened about 3 years ago by