Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhao's picture
1

zhao

poonyZ

AI & ML interests

None yet

Recent Activity

updated a collection 3 days ago
video LM
updated a collection 3 days ago
omni
updated a collection 3 days ago
video LM
View all activity

Organizations

None yet

Collections 10

omni
  • VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    Paper • 2501.01957 • Published 9 days ago • 34
  • From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities

    Paper • 2410.02155 • Published Oct 3, 2024 • 2
  • OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

    Paper • 2501.04561 • Published 4 days ago • 15
T2I
  • Parallelized Autoregressive Visual Generation

    Paper • 2412.15119 • Published 24 days ago • 50
  • CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

    Paper • 2412.16112 • Published 23 days ago • 21
  • 1.58-bit FLUX

    Paper • 2412.18653 • Published 19 days ago • 69

models

None public yet

datasets

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs