209 8 145

Tiezhen WANG

xianbao

xianbaoqian

AI & ML interests

This is my personal account

Recent Activity

liked a Space 7 days ago

enzostvs/deepsite

liked a dataset 9 days ago

OpenRobotLab/GRScenes

liked a model 9 days ago

BAAI/RoboBrain

View all activity

Organizations

xianbao's activity

liked a Space 7 days ago

2.56k

DeepSite

🐳

Generate any application with DeepSeek

liked a dataset 9 days ago

OpenRobotLab/GRScenes

Updated Feb 28 • 552 • 4

liked a model 9 days ago

BAAI/RoboBrain

Updated 2 days ago • 111 • 7

published an article 11 days ago

Article

Test

•

11 days ago

liked a model 15 days ago

stepfun-ai/stepvideo-ti2v

Image-to-Video • Updated 16 days ago • 234 • 69

liked a dataset 15 days ago

open-r1/codeforces-cots

Viewer • Updated 8 days ago • 254k • 10.3k • 127

liked a model 16 days ago

tencent/Hunyuan3D-2mv

Image-to-3D • Updated 17 days ago • 9.92k • 358

liked a Space 19 days ago

146

WeShopAI Virtual Try On

👕

Transform flat-lay shots into on-model photos

New activity in microsoft/Magma-8B about 1 month ago

Fix the code snippets

#12 opened about 1 month ago by

xianbao

updated a model about 1 month ago

xianbao/test-tag-update

Token Classification • Updated Mar 3

liked a model about 2 months ago

ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

Reinforcement Learning • Updated Feb 13 • 17.7k • 776

liked a Space about 2 months ago

8.04k

FLUX.1 [dev]

🖥

Generate images from text prompts

updated a collection 2 months ago

🔊 Audio Models

Collection

19 items • Updated 8 days ago • 6

reacted to merve's post with 🔥 2 months ago

Post

5323

Oof, what a week! 🥵 So many things have happened, let's recap! merve/jan-24-releases-6793d610774073328eac67a9

Multimodal 💬
- We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG 💗
- UI-TARS are new models by ByteDance to unlock agentic GUI control 🤯 in 2B, 7B and 72B
- Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B
- MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context
- Dataset: Yale released a new benchmark called MMVU
- Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark

LLMs 📖
- DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🤯
- Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B
- NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!)

Audio 🗣️
- Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B
- TangoFlux is a new audio generation model trained from scratch and aligned with CRPO

Image/Video/3D Generation ⏯️
- Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux
- tencent released Hunyuan3D-2, new 3D asset generation from images