Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
37
10
Le Huy Hoang
splendor1811
Follow
yehors-cv's profile picture
21world's profile picture
2 followers
·
14 following
huyhoang18112k2
AI & ML interests
Computer Vision
Recent Activity
reacted
to
sometimesanotion
's
post
with 🚀
about 23 hours ago
**Update** Either I had some wrong numbers plugged in to estimate benchmark numbers from comparator, or the benchmark changed. Virtuoso Small v2 at 41.07 average is still very impressive, especially for writing draft copy for business purposes, while Lamarck remains a chatty generalist-reasoning model. I've felt confident that 14B Qwen finetunes and merges could break the 42.0 average, and Arcee **came close** with https://huggingface.co/arcee-ai/Virtuoso-Small-2. Congratulations to @arcee-ai! Just two months ago, it was easy to think that 14B had plateaued, that you could have high IFEVAL or high MUSR/MATH/GPQA at 14B, but not both. That barrier is completely shattered. I see a pathway to even better, and Virtuoso Small 2 is a big part of why. Very impressive work. This community would expect no less from Arcee. Just look at this graph! Keep in mind, my merges here build on the first Virtuoso Small, and *-DS merges build on DeepSeek R1. There are some impressive merges in the pipe!
upvoted
a
paper
19 days ago
Tensor Product Attention Is All You Need
upvoted
a
collection
24 days ago
Phi-3
View all activity
Organizations
None yet
splendor1811
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 models
9 months ago
microsoft/Phi-3-mini-128k-instruct
Text Generation
•
Updated
Aug 20, 2024
•
254k
•
1.63k
nvidia/Llama3-ChatQA-1.5-8B
Text Generation
•
Updated
May 24, 2024
•
10.7k
•
555
liked
a Space
9 months ago
Runtime error
39
🐢
LLM Merge Adapter
liked
a model
9 months ago
HyperGAI/HPT1_5-Air-Llama-3-8B-Instruct-multimodal
Text Generation
•
Updated
May 15, 2024
•
13
•
47
liked
a dataset
9 months ago
allenai/dolma
Updated
Apr 17, 2024
•
1.33k
•
870
liked
a model
10 months ago
llava-hf/llava-1.5-7b-hf
Image-Text-to-Text
•
Updated
8 days ago
•
752k
•
226
liked
a model
about 1 year ago
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
•
Updated
Aug 19, 2024
•
1.65M
•
•
4.28k
liked
a Space
over 1 year ago
Running
on
CPU Upgrade
12.4k
🏆
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
liked
a dataset
almost 2 years ago
togethercomputer/RedPajama-Data-1T
Viewer
•
Updated
Jun 17, 2024
•
1.73M
•
1.25k
•
1.07k
liked
a Space
almost 2 years ago
Running
2.51k
⚡️
Whisper JAX