FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper • 2412.09626 • Published 13 days ago • 19
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28 • 34
Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction Paper • 2310.18770 • Published Oct 28, 2023
Discovering Spatio-Temporal Rationales for Video Question Answering Paper • 2307.12058 • Published Jul 22, 2023
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models Paper • 2410.07133 • Published Oct 9 • 18
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28 • 34
DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning Paper • 2407.04078 • Published Jul 4 • 17
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter Paper • 2310.12798 • Published Oct 19, 2023 • 4
Uni-SMART: Universal Science Multimodal Analysis and Research Transformer Paper • 2403.10301 • Published Mar 15 • 52