Doğuş Can Korkmaz

doguscank

AI & ML interests

Vision, LLMs, vLLMs, semantic segmentation, forecasting

Recent Activity

updated a collection 3 days ago

to read

updated a collection 3 days ago

integration

updated a collection 3 days ago

integration

View all activity

Organizations

None yet

doguscank's activity

upvoted a paper 10 days ago

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published 10 days ago • 46

upvoted a paper 20 days ago

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Paper • 2412.15322 • Published 24 days ago • 18

upvoted a paper 21 days ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 24 days ago • 50

upvoted a paper 27 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published about 1 month ago • 137

upvoted 2 papers 28 days ago

FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Paper • 2412.09611 • Published Dec 12, 2024 • 9

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 88

upvoted 7 papers about 1 month ago

LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation

Paper • 2412.05148 • Published Dec 6, 2024 • 11

Video Motion Transfer with Diffusion Transformers

Paper • 2412.07776 • Published Dec 10, 2024 • 17

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics

Paper • 2412.07774 • Published Dec 10, 2024 • 26

Mind the Time: Temporally-Controlled Multi-Event Video Generation

Paper • 2412.05263 • Published Dec 6, 2024 • 10

upvoted a collection about 1 month ago

AIMv2

Collection

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 69

upvoted 6 papers about 2 months ago

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity

Paper • 2411.15411 • Published Nov 23, 2024 • 7

SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE

Paper • 2411.16856 • Published Nov 25, 2024 • 11

TEXGen: a Generative Diffusion Model for Mesh Textures

Paper • 2411.14740 • Published Nov 22, 2024 • 15

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

Paper • 2411.15466 • Published Nov 23, 2024 • 35

Material Anything: Generating Materials for Any 3D Object via Diffusion

Paper • 2411.15138 • Published Nov 22, 2024 • 42

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 26