AI & ML interests

Evaluating open LLMs

Recent Activity

open-llm-leaderboard's activity

victorย 
posted an update about 9 hours ago
view post
Post
498
Hey everyone, we've given https://hf.co/spaces page a fresh update!

Smart Search: Now just type what you want to doโ€”like "make a viral meme" or "generate music"โ€”and our search gets it.

New Categories: Check out the cool new filter bar with icons to help you pick a category fast.

Redesigned Space Cards: Reworked a bit to really show off the app descriptions, so you know what each Space does at a glance.

Random Prompt: Need ideas? Hit the dice button for a burst of inspiration.

Weโ€™d love to hear what you thinkโ€”drop us some feedback plz!
  • 1 reply
ยท
albertvillanovaย 
posted an update about 9 hours ago
view post
Post
267
๐Ÿš€ Introducing @huggingface Open Deep-Research๐Ÿ’ฅ

In just 24 hours, we built an open-source agent that:
โœ… Autonomously browse the web
โœ… Search, scroll & extract info
โœ… Download & manipulate files
โœ… Run calculations on data

55% on GAIA validation set! Help us improve it!๐Ÿ’ก
https://huggingface.co/blog/open-deep-research
  • 1 reply
ยท
AdinaYย 
posted an update 7 days ago
victorย 
posted an update 8 days ago
view post
Post
2906
Finally, an open-source AI that turns your lyrics into full songs is hereโ€”meet YuE! Unlike other tools that only create short clips, YuE can make entire songs (up to 5 minutes) with vocals, melody, and instruments all working together. Letsss go!

m-a-p/YuE-s1-7B-anneal-en-cot
AdinaYย 
posted an update 9 days ago
view post
Post
2580
๐Ÿ”ฅSo many exciting releases coming from the Chinese community this month!
zh-ai-community/2025-january-6786b054f492fb223591269e

LLMs:
โœจ Qwen2.5 -1M by Alibaba
Qwen/qwen25-1m-679325716327ec07860530ba
โœจ InternLM3-8B-Instruct by Shanghai AI Lab
internlm/internlm3-8b-instruct
โœจ MiniMax-Text-01 by MiniMax AI
MiniMaxAI/MiniMax-Text-01
โœจ RWKV-7 by BlinkDL -- RNN + Transformer ๐Ÿ‘€
BlinkDL/rwkv-7-world
โœจ DeepSeek-R1 by DeepSeek -- THE ONE ๐Ÿ™Œ
https://huggingface.co/deepseek-ai
โœจ Baichuan-M1-14B by Baichuan - Medical ๐Ÿฉบ
baichuan-inc/Baichuan-M1-14B-Base
โœจ Qwen2.5-Math-PRM by Alibaba - Math ๐Ÿ”ข
Qwen/Qwen2.5-Math-PRM-7B

Code:
โœจ Tare by Bytedance
https://trae.ai

TTS:
โœจ T2A-01-HD by MiniMax AI
https://hailuo.ai/audio
โœจ LLaSA by HKUST Audio
HKUSTAudio/Llasa-3B

MLLM:
โœจ Kimi k1.5 by Moonshot AI
https://kimi.ai
โœจ MiniCPM-o-2_6 by OpenBMB
openbmb/MiniCPM-o-2_6
โœจ Sa2VA-4B by ByteDance
ByteDance/Sa2VA-4B
โœจ VideoLLaMA 3 by Alibaba DAMO
DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
โœจ LLaVA-Mini by Chinese Academy of Sciences
ICTNLP/llava-mini-llama-3.1-8b
โœจHunyuan-7B by Tencent
tencent/Hunyuan-7B-Instruct
โœจ Hunyuan 3D 2.0 by Tencent
tencent/Hunyuan3D-2
โœจMiniMax-VL-01 by MiniMax AI - A non transformer based VLM ๐Ÿ‘€
MiniMaxAI/MiniMax-VL-01

Agent:
โœจ UI-TARS by Bytedance
bytedance-research/UI-TARS-7B-SFT
โœจ GLM-PC by Zhipu AI
https://cogagent.aminer.cn

Dataset:
โœจ Fineweb-Edu-Chinese by Opencsg
opencsg/Fineweb-Edu-Chinese-V2.1
โœจ Multimodal_textbook by Alibaba
DAMO-NLP-SG/multimodal_textbook
โœจ MME-Finance by Hithink AI
ยท
lewtunย 
posted an update 11 days ago
view post
Post
9796
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

๐Ÿงช Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

๐Ÿง  Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

๐Ÿ”ฅ Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1
ยท
AdinaYย 
posted an update 12 days ago
AdinaYย 
posted an update 13 days ago
AdinaYย 
posted an update 13 days ago
AdinaYย 
posted an update 15 days ago
view post
Post
2933
What happened yesterday in the Chinese AI community? ๐Ÿš€

T2A-01-HD ๐Ÿ‘‰ https://hailuo.ai/audio
MiniMax's Text-to-Audio model, now in Hailuo AI, offers 300+ voices in 17+ languages and instant emotional voice cloning.

Tare ๐Ÿ‘‰ https://www.trae.ai/
A new coding tool by Bytedance for professional developers, supporting English & Chinese with free access to Claude 3.5 and GPT-4 for a limited time.

DeepSeek-R1 Series ๐Ÿ‘‰ deepseek-ai/deepseek-r1-678e1e131c0169c0bc89728d
Open-source reasoning models with MIT license by DeepSeek.

Kimi K 1.5 ๐Ÿ‘‰ https://github.com/MoonshotAI/Kimi-k1.5 | https://kimi.ai/
An O1-level multi-modal model by MoonShot AI, utilizing reinforcement learning with long and short-chain-of-thought and supporting up to 128k tokens.

And todayโ€ฆ

Hunyuan 3D-2.0 ๐Ÿ‘‰ tencent/Hunyuan3D-2
A SoTA 3D synthesis system for high-res textured assets by Tencent Hunyuan , with open weights and code!

Stay tuned for more updates ๐Ÿ‘‰ https://huggingface.co/zh-ai-community
AdinaYย 
posted an update 15 days ago
view post
Post
918
Hunyuan 3D 2.0๐Ÿ”ฅ a synthesis system for high-res textured 3D assets released by Tencent Hunyuan

2 key components: Hunyuan3D-DiT (geometry) and Hunyuan3D-Paint (textures) work together, achieving highly realistic 3D results.

Model: tencent/Hunyuan3D-2
Demo coming soon!
AdinaYย 
posted an update 16 days ago
view post
Post
2811
BIG release by DeepSeek AI๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ

DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co/deepseek-ai
deepseek-ai/DeepSeek-R1

โœจ MIT License : enabling distillation for custom models
โœจ 32B & 70B models match OpenAI o1-mini in multiple capabilities
โœจ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'
AdinaYย 
posted an update 19 days ago