Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS

StepFun
company
AI & ML interests
None defined yet.
Recent Activity
Organization Card
Welcome to StepFun π
StepFun, founded in April 2023 with the mission to βScale-up possibilities for everyone,β unites top talent in artificial intelligence from both domestic and international backgrounds, and is dedicated to advancing toward AGI. The company has already launched the Step series of foundation models, which includes Step-2, a cutting-edge trillion-parameter Mixture of Experts (MoE) language model; Step-1.5V, a powerful multimodal large model; and Step-1V, an innovative image generation model, among others.
Collections
1
spaces
2
models
8

stepfun-ai/stepvideo-ti2v
Image-to-Video
β’
Updated
β’
7
β’
11

stepfun-ai/stepvideo-t2v
Text-to-Video
β’
Updated
β’
1.91k
β’
417

stepfun-ai/Step-Audio-Tokenizer
Updated
β’
33

stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text
β’
Updated
β’
1.26k
β’
429

stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
β’
Updated
β’
2.14k
β’
171

stepfun-ai/stepvideo-t2v-turbo
Updated
β’
85

stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
β’
Updated
β’
70.5k
β’
1.42k

stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
β’
Updated
β’
187k
β’
174