
ACIDE/User-VLM-3B-base
Image-Text-to-Text
•
Updated
•
666
A series of Personalized Vision Language Models for Social Human-Robot Interactions
Note Base Models
Note Instruction-finetuned Models
Note DPO finetuned Models