MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Paper
•
2407.04842
•
Published
•
52
Note Base for discussion about NSFW and Biases in AI
Note Critique on why some benchmarks are biased when comparing LLM and MLLM. New benchmark fixing those errors
Note New SOTA - better than llama!