Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
17
1
Yi Cui
onekq
Follow
Becky4455's profile picture
Abazu24-P's profile picture
DhanushTutu's profile picture
63 followers
ยท
24 following
https://onekq.ai
onekq_ai
onekq
yicui
AI & ML interests
Benchmark, Code Generation Model
Recent Activity
posted
an
update
about 13 hours ago
Qwen made good students, DeepSeek made a genius. This is my summaries of their differentiations. I don't think these two players are coordinated but they both have clear goals. One is to build ecosystem and the other is to push AGI. And IMO they are both doing really well.
replied
to
their
post
about 13 hours ago
The performance of deepseek-r1-distill-qwen-32b is abysmal. I know Qwen instruct (not coder) is quite poor on coding. As such, I have low expectation on other R1 repro works also based on Qwen instruct too. https://huggingface.co/collections/onekq-ai/r1-reproduction-works-67a93f2fb8b21202c9eedf0b This makes it particularly mysterious what went into QwQ-32B? Why did it work so well? Was it trained from scratch? Anyone has insights about this? https://huggingface.co/spaces/onekq-ai/WebApp1K-models-leaderboard
updated
a model
about 13 hours ago
onekq-ai/OneSQL-v0.1-Qwen-32B-GGUF
View all activity
Organizations
onekq
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
5 months ago
Running
19
19
Quant Request
๐ฆ
Submit Hugging Face model links for quantization requests