1 5 1

Kai-Wei Chang

kaiweichang

http://kwchang.net

AI & ML interests

Natural Language Processing, Algorithmic Fairness, Multi-modal models

Recent Activity

authored a paper 13 days ago

When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning

authored a paper 22 days ago

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

authored a paper 4 months ago

STIV: Scalable Text and Image Conditioned Video Generation

View all activity

Organizations

kaiweichang's activity

authored a paper 13 days ago

When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning

Paper • 2504.01005 • Published 13 days ago • 15

authored a paper 22 days ago

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

Paper • 2503.17352 • Published 24 days ago • 22

authored a paper 4 months ago

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published Dec 10, 2024 • 74

authored 2 papers 6 months ago

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Paper • 2410.10813 • Published Oct 14, 2024 • 11

Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models

Paper • 2410.05269 • Published Oct 7, 2024 • 3

liked a Space 10 months ago

Desco

📚

authored a paper 10 months ago

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13, 2024 • 20

authored a paper about 1 year ago

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21, 2024 • 53

published an article about 1 year ago

Article

Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?

and 4 others •

Mar 5, 2024

• 4

authored a paper about 1 year ago

ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models

Paper • 2401.13311 • Published Jan 24, 2024 • 11

authored 2 papers over 1 year ago

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 70

VideoCon: Robust Video-Language Alignment via Contrast Captions

Paper • 2311.10111 • Published Nov 15, 2023 • 9

upvoted a paper over 1 year ago

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Paper • 2311.05657 • Published Nov 9, 2023 • 32

authored 2 papers over 1 year ago

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Paper • 2311.05657 • Published Nov 9, 2023 • 32

FLIRT: Feedback Loop In-context Red Teaming

Paper • 2308.04265 • Published Aug 8, 2023 • 13

upvoted 2 papers over 1 year ago

FLIRT: Feedback Loop In-context Red Teaming

Paper • 2308.04265 • Published Aug 8, 2023 • 13

RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment

Paper • 2307.12950 • Published Jul 24, 2023 • 10

authored a paper over 1 year ago

Grounded Language-Image Pre-training

Paper • 2112.03857 • Published Dec 7, 2021 • 3

upvoted 2 papers over 1 year ago

Grounded Language-Image Pre-training

Paper • 2112.03857 • Published Dec 7, 2021 • 3

DesCo: Learning Object Recognition with Rich Language Descriptions

Paper • 2306.14060 • Published Jun 24, 2023 • 1