Alvin Li's picture

Alvin Li

alvanlii

AI & ML interests

None yet

Recent Activity

Organizations

Demo Crafters 🤗 's profile picture Hugging Face for Computer Vision's profile picture MLX Community's profile picture hon9kon9ize's profile picture Chinese LLMs on Hugging Face's profile picture A++Geese's profile picture Upcyle VLM @ C4AI Community's profile picture

alvanlii's activity

New activity in alvanlii/whisper-small-cantonese 28 days ago

`use_cache=False`

2
#11 opened 29 days ago by
jemoka
reacted to AdinaY's post with 🔥 about 1 month ago
view post
Post
4023
Exciting releases from the Chinese community this February🔥
👉 https://huggingface.co/collections/zh-ai-community/2025-february-67a35aaa68e97812def5b6ef

MLLM:
✨ Ovis2 by Alibaba
AIDC-AI/ovis2-67ab36c7e497429034874464
✨ Step Audio Chat by StepFun AI
stepfun-ai/step-audio-67b33accf45735bb21131b0b

Audio:
✨ Step Audio TTS by StepFunAI
stepfun-ai/Step-Audio-TTS-3B
✨ InspireMusic by Alibaba
FunAudioLLM
✨ Baichuan Audio by BaichuanAI
baichuan-inc/Baichuan-Audio-Instruct

Video:
✨ Wan2.1 by Alibaba_Wan
Wan-AI/Wan2.1-T2V-14B
✨ Stepvideo-T2V by StepFun AI
stepfun-ai/stepvideo-t2v
✨ SkyReels-V1 by Skywork
Skywork/skyreels-v1-67b34676ff65b4ec02d16307
✨ LLaDA-8B by RenminUniversity
GSAI-ML/LLaDA-8B-Instruct

MoE:
✨ Moonlight-16B by MoonshotAI (Kimi)
moonshotai/Moonlight-16B-A3B-Instruct

Reasoning:
✨ TinyR1-32B by Qihoo360
qihoo360/TinyR1-32B-Preview

Dataset:
✨ Chinese DeepSeek R1-Distill data -110k
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
reacted to alibabasglab's post with 👍 3 months ago
view post
Post
5309
🎉 ClearerVoice-Studio New Feature: Speech Super-Resolution with MossFormer2 ! 🚀
We’re excited to announce that ClearerVoice-Studio now supports speech super-resolution, powered by our latest MossFormer2-based model!
What’s New?

🔊 Convert Low-Resolution to High-Resolution Audio:
Transform low-resolution audio (effective sampling rate ≥ 16 kHz) into crystal-clear, high-resolution audio at 48 kHz.

🤖 Cutting-Edge Technology:
Leverages the MossFormer2 model plus HiFi-GAN, optimised for generating high-quality audio with enhanced perceptual clarity.

🎧 Enhanced Listening Experience:
Perfect for speech enhancement, content restoration, and high-fidelity audio applications.

🌟 Try It Out!
Upgrade to the latest version of ClearerVoice-Studio (https://github.com/modelscope/ClearerVoice-Studio) to experience this powerful feature. Check out the updated documentation and examples in our repository.

Let us know your thoughts, feedback, or feature requests in the Issues section.
reacted to not-lain's post with 🔥 3 months ago
view post
Post
4048
Published a new blogpost 📖
In this blogpost I have gone through the transformers' architecture emphasizing how shapes propagate throughout each layer.
🔗 https://huggingface.co/blog/not-lain/tensor-dims
some interesting takeaways :
reacted to burtenshaw's post with ❤️ 3 months ago
view post
Post
3052
People are flexing their end of year stats, so I made this app to show hub stats in a tidy design!

Thanks @Ameeeee and @jfcalvo for the feature from Argilla!
burtenshaw/recap
  • 1 reply
·
New activity in open-acc/README 4 months ago