2 3 3

zkjiang

justin-zk

https://jiangzhengkai.github.io/

AI & ML interests

Generative AI

Recent Activity

liked a Space 29 days ago

NJU/RAG-Diffusion

authored a paper about 1 month ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

upvoted a paper about 1 month ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

View all activity

Organizations

None yet

justin-zk's activity

liked a Space 29 days ago

Running on Zero

👀

RAG Demo

authored a paper about 1 month ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Paper • 2411.06558 • Published Nov 10 • 34

upvoted a paper about 1 month ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Paper • 2411.06558 • Published Nov 10 • 34

liked 2 Spaces about 2 months ago

Running on CPU Upgrade

6.28k

👕

Kolors Virtual Try-On

Sleeping

📉

Personalize SAM

updated a Space 3 months ago

Sleeping

📉

Personalize SAM

authored 10 papers 3 months ago

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Paper • 2305.11176 • Published May 18, 2023

You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction

Paper • 2205.14871 • Published May 30, 2022

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

Paper • 2405.05945 • Published May 9 • 2

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Paper • 2403.11289 • Published Mar 17

OSV: One Step is Enough for High-Quality Image to Video Generation

Paper • 2409.11367 • Published Sep 17 • 13

SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation

Paper • 2409.18082 • Published Sep 26

UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models

Paper • 2409.20551 • Published Sep 30 • 13

upvoted a paper 3 months ago

UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models

Paper • 2409.20551 • Published Sep 30 • 13

upvoted a paper 6 months ago

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2 • 51

New activity in justin-zk/Personalize-SAM over 1 year ago

Problems with multi-cat detection

#2 opened over 1 year ago by

CapitanMurloc

Apply for community grant: Academic project

#1 opened over 1 year ago by

justin-zk