The Chinese University of Hong Kong

university

Activity Feed Request to join this org

AI & ML interests

Artificial Intelligence

CUHK-CSE's activity

zongzhuofan

authored 2 papers 3 months ago

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

Paper • 2412.11279 • Published Dec 15, 2024 • 12

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published Dec 12, 2024 • 21

HaoyuHuang2

authored a paper 3 months ago

Can LLMs be Good Graph Judger for Knowledge Graph Construction?

Paper • 2411.17388 • Published Nov 26, 2024

kqsun

authored a paper 4 months ago

JourneyDB: A Benchmark for Generative Image Understanding

Paper • 2307.00716 • Published Jul 3, 2023 • 19

Siheng99

authored 4 papers 5 months ago

Question Answering as Programming for Solving Time-Sensitive Questions

Paper • 2305.14221 • Published May 23, 2023

AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models

Paper • 2308.06507 • Published Aug 12, 2023 • 1

Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast

Paper • 2405.14507 • Published May 23, 2024

A Survey on the Honesty of Large Language Models

Paper • 2409.18786 • Published Sep 27, 2024 • 32

kqsun

authored a paper 7 months ago

GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars

Paper • 2408.13674 • Published Aug 24, 2024 • 18

zongzhuofan

authored 10 papers 9 months ago

RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection

Paper • 2110.12130 • Published Oct 23, 2021

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Paper • 2404.13046 • Published Apr 19, 2024 • 1

Large-batch Optimization for Dense Visual Predictions

Paper • 2210.11078 • Published Oct 20, 2022

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models

Paper • 2406.11831 • Published Jun 17, 2024 • 22

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Paper • 2404.03653 • Published Apr 4, 2024 • 36

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

Paper • 2403.16999 • Published Mar 25, 2024 • 4

Self-slimmed Vision Transformer

Paper • 2111.12624 • Published Nov 24, 2021 • 1

DETRs with Collaborative Hybrid Assignments Training

Paper • 2211.12860 • Published Nov 22, 2022

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

Paper • 2304.00967 • Published Apr 3, 2023

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths

Paper • 2305.18295 • Published May 29, 2023 • 7

EstherCheng

authored a paper 10 months ago

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29, 2024 • 49