Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
29.1
TFLOPS
69
16
Steve Li
CHNtentes
Follow
ltim's profile picture
21world's profile picture
2 followers
·
13 following
CHNtentes
AI & ML interests
None yet
Recent Activity
new
activity
1 day ago
stduhpf/google-gemma-3-12b-it-qat-q4_0-gguf-small:
Update?
new
activity
5 days ago
meta-llama/Llama-4-Scout-17B-16E-Instruct:
Unethical comparisons with Deepseek replacing chinese languages by thai/vietnamese only
new
activity
9 days ago
google/gemma-3-27b-it:
SigLIP or SigLIP2 encoder?
View all activity
Organizations
None yet
CHNtentes
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
stduhpf/google-gemma-3-12b-it-qat-q4_0-gguf-small
1 day ago
Update?
1
#4 opened 1 day ago by
CHNtentes
New activity in
meta-llama/Llama-4-Scout-17B-16E-Instruct
5 days ago
Unethical comparisons with Deepseek replacing chinese languages by thai/vietnamese only
5
#32 opened 6 days ago by
krustik
New activity in
google/gemma-3-27b-it
9 days ago
SigLIP or SigLIP2 encoder?
7
#48 opened 10 days ago by
orrzohar
New activity in
deepseek-ai/DeepSeek-V3-0324
11 days ago
Downloading weights without duplicates
2
#52 opened 11 days ago by
vadimkantorov
New activity in
deepseek-ai/DeepSeek-V3-0324
15 days ago
听我说谢谢你因为有你温暖了四季
4
#37 opened 17 days ago by
iwangdy
New activity in
Qwen/Qwen2.5-Omni-7B
16 days ago
Qwen2.5-Omni-7B-AWQ?
1
#7 opened 16 days ago by
CHNtentes
Can this be quantized with bitsAndBytes?
8
#6 opened 17 days ago by
Permahuman
liked
a model
18 days ago
deepseek-ai/DeepSeek-V3-0324
Text Generation
•
Updated
16 days ago
•
201k
•
•
2.54k
New activity in
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
19 days ago
Int4为什么比没量化的float32和float16还慢
1
#3 opened about 1 month ago by
hujianmin
New activity in
THUDM/CogView4-6B
21 days ago
Maybe rename it 15B
2
#5 opened about 1 month ago by
CHNtentes
New activity in
mistralai/Mistral-Small-3.1-24B-Instruct-2503
21 days ago
Quantized models with vision included?
12
#27 opened 24 days ago by
geoad
New activity in
bartowski/nvidia_Llama-3_3-Nemotron-Super-49B-v1-GGUF
24 days ago
Does it support thinking on/off?
5
#2 opened 24 days ago by
CHNtentes
liked
a model
about 1 month ago
Qwen/QwQ-32B
Text Generation
•
Updated
Mar 11
•
810k
•
•
2.66k
New activity in
unsloth/QwQ-32B
about 1 month ago
I could be wrong, but I think the <think> tag needs to be removed from the last bit of the jinja template in the tokenizer_config.json
3
#1 opened about 1 month ago by
jth01
liked
a Space
about 1 month ago
Running
526
526
QwQ 32B Demo
🌖
Send text and get detailed responses
New activity in
Qwen/QwQ-32B
about 1 month ago
When will you fix the model replies missing</think>\n start tags
17
#19 opened about 1 month ago by
xldistance
liked
a Space
about 1 month ago
Running
on
Zero
106
106
CogView4
🖌
Gradio demo of CogView4-6B
New activity in
moonshotai/Moonlight-16B-A3B-Instruct
about 1 month ago
When running example got ValueError: Attention mask should be of size (1, 1, 1, 30), but is torch.Size([1, 1, 1, 29])
6
#9 opened about 1 month ago by
Phando
New activity in
nvidia/DeepSeek-R1-FP4
about 2 months ago
Benchmark results compared to orig fp8 / int4 quants etc?
5
#1 opened about 2 months ago by
CHNtentes
New activity in
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
about 2 months ago
有的题目不完整
3
#6 opened about 2 months ago by
CHNtentes
Load more