SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper โข 2501.17161 โข Published Jan 28 โข 108
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper โข 2408.08872 โข Published Aug 16, 2024 โข 99
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper โข 2407.12784 โข Published Jul 17, 2024 โข 49