-
Reinforcement Learning: An Overview
Paper • 2412.05265 • Published • 7 -
Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Paper • 2411.01156 • Published • 6 -
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
Paper • 2503.21755 • Published • 31 -
Qwen2.5-Omni Technical Report
Paper • 2503.20215 • Published • 134
LI
RogerZhuo
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
liked
a model
2 days ago
OuteAI/Llama-OuteTTS-1.0-1B
upvoted
a
paper
3 days ago
Kimi-VL Technical Report
Organizations
Collections
9
-
ElectricAlexis/NotaGen
Updated • 135 -
ASLP-lab/LLaSE-G1
Audio-to-Audio • Updated • 20 -
544
Di♪♪Rhythm
🎶Blazingly Fast and Embarrassingly Simple Song Generation
-
DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Paper • 2503.01183 • Published • 26
models
None public yet
datasets
None public yet