RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness Paper • 2405.17220 • Published May 27, 2024 • 1
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding Paper • 2308.10529 • Published Aug 21, 2023 • 1
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback Paper • 2312.00849 • Published Dec 1, 2023 • 8