Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper • 2504.12626 • Published 5 days ago • 45
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper • 2504.06261 • Published 14 days ago • 103
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 15 days ago • 167
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 19 days ago • 53
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published 27 days ago • 49
Training-free Diffusion Acceleration with Bottleneck Sampling Paper • 2503.18940 • Published 29 days ago • 12
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity Paper • 2503.16418 • Published Mar 20 • 35
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 47
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published Mar 17 • 96
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published Mar 12 • 69
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published Mar 3 • 84
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published Feb 20 • 90