
bytedance-research/UI-TARS-7B-DPO
Image-Text-to-Text
โข
Updated
โข
19k
โข
156
Generate high-quality edited images from portrait prompts and IDs
Gradio demo of CogView4-6B
Blazingly Fast and Embarrassingly Simple Song Generation
Translate text between English and other languages
Interact with AI using text, images, or audio
Play with all the pix2struct variants in this d
Generate text or segment objects from an image