ys-st

classroom

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

stzhao authored a paper 7 days ago

OmniCaptioner: One Captioner to Rule Them All

stzhao authored a paper 21 days ago

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

stzhao authored a paper 28 days ago

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

View all activity

ys-st's activity

stzhao

authored a paper 7 days ago

OmniCaptioner: One Captioner to Rule Them All

Paper • 2504.07089 • Published 9 days ago • 17

stzhao

authored a paper 21 days ago

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Paper • 2503.21749 • Published 22 days ago • 25

stzhao

authored 2 papers 28 days ago

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

Paper • 2503.13939 • Published Mar 18 • 4

CLS-RL: Image Classification with Rule-Based Reinforcement Learning

Paper • 2503.16188 • Published 29 days ago • 9

stzhao

authored a paper 3 months ago

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Paper • 2501.13920 • Published Jan 23 • 17

stzhao

authored 3 papers 6 months ago

Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models

Paper • 2410.00363 • Published Oct 1, 2024 • 1

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

Paper • 2312.06685 • Published Dec 9, 2023 • 1

Boosting Open-Domain Continual Learning via Leveraging Intra-domain Category-aware Prototype

Paper • 2408.09984 • Published Aug 19, 2024 • 1

stzhao

authored a paper 7 months ago

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

Paper • 2409.15278 • Published Sep 23, 2024 • 26

stzhao

authored a paper 8 months ago

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Paper • 2408.02657 • Published Aug 5, 2024 • 36

stzhao

authored a paper about 1 year ago

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Paper • 2402.05935 • Published Feb 8, 2024 • 17

AI & ML interests

Recent Activity

Team members 2

ys-st's activity