Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
29.1
TFLOPS
64
16
Steve Li
CHNtentes
Follow
ltim's profile picture
21world's profile picture
2 followers
·
13 following
CHNtentes
AI & ML interests
None yet
Recent Activity
new
activity
about 13 hours ago
Qwen/Qwen2.5-Omni-7B:
Qwen2.5-Omni-7B-AWQ?
new
activity
about 13 hours ago
Qwen/Qwen2.5-Omni-7B:
Can this be quantized with bitsAndBytes?
liked
a model
2 days ago
deepseek-ai/DeepSeek-V3-0324
View all activity
Organizations
None yet
CHNtentes
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Qwen/Qwen2.5-Omni-7B
about 13 hours ago
Qwen2.5-Omni-7B-AWQ?
#7 opened about 13 hours ago by
CHNtentes
Can this be quantized with bitsAndBytes?
5
#6 opened about 17 hours ago by
Permahuman
liked
a model
2 days ago
deepseek-ai/DeepSeek-V3-0324
Text Generation
•
Updated
about 12 hours ago
•
32.5k
•
•
1.83k
New activity in
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
3 days ago
Int4为什么比没量化的float32和float16还慢
1
#3 opened 19 days ago by
hujianmin
New activity in
THUDM/CogView4-6B
5 days ago
Maybe rename it 15B
2
#5 opened 22 days ago by
CHNtentes
New activity in
mistralai/Mistral-Small-3.1-24B-Instruct-2503
5 days ago
Quantized models with vision included?
12
#27 opened 8 days ago by
geoad
New activity in
bartowski/nvidia_Llama-3_3-Nemotron-Super-49B-v1-GGUF
8 days ago
Does it support thinking on/off?
5
#2 opened 8 days ago by
CHNtentes
liked
a model
21 days ago
Qwen/QwQ-32B
Text Generation
•
Updated
16 days ago
•
663k
•
•
2.55k
New activity in
unsloth/QwQ-32B
21 days ago
I could be wrong, but I think the <think> tag needs to be removed from the last bit of the jinja template in the tokenizer_config.json
3
#1 opened 22 days ago by
jth01
liked
a Space
21 days ago
Running
496
496
QwQ 32B Demo
🌖
Send text and get detailed responses
New activity in
Qwen/QwQ-32B
21 days ago
When will you fix the model replies missing</think>\n start tags
17
#19 opened 21 days ago by
xldistance
liked
a Space
22 days ago
Running
on
Zero
104
104
CogView4
🖌
Gradio demo of CogView4-6B
New activity in
moonshotai/Moonlight-16B-A3B-Instruct
24 days ago
When running example got ValueError: Attention mask should be of size (1, 1, 1, 30), but is torch.Size([1, 1, 1, 29])
6
#9 opened 27 days ago by
Phando
New activity in
nvidia/DeepSeek-R1-FP4
about 1 month ago
Benchmark results compared to orig fp8 / int4 quants etc?
4
#1 opened about 1 month ago by
CHNtentes
New activity in
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
about 1 month ago
有的题目不完整
3
#6 opened about 1 month ago by
CHNtentes
liked
a model
about 1 month ago
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4
Reinforcement Learning
•
Updated
1 day ago
•
12.2k
•
207
New activity in
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
2 months ago
Why change the configuration of the tokenizer?
2
#4 opened 2 months ago by
Lingrui
New activity in
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF
2 months ago
Quality vs 4bnb version
5
#2 opened 2 months ago by
supercharge19
New activity in
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
2 months ago
System Prompt
8
#2 opened 2 months ago by
Wanfq
New activity in
Qwen/Qwen2.5-Coder-32B-Instruct
2 months ago
Thieves!
9
#36 opened 2 months ago by
supercharge19
Load more