Running 36 36 Chat with Kimi-VL (Image, Agent, Video, PDF) ๐ Chat with Kimi-VL-A3B-Instruct using text, images, and videos
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? Paper โข 2407.04842 โข Published Jul 5, 2024 โข 57