FlexTok: Resampling Images into 1D Token Sequences of Flexible Length Paper • 2502.13967 • Published Feb 19 • 1
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Paper • 2406.09406 • Published Jun 13, 2024 • 15