meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated 2 days ago • 338k • • 731
FlowTok: Flowing Seamlessly Across Text and Image Tokens Paper • 2503.10772 • Published 29 days ago • 18
Learning Few-Step Diffusion Models by Trajectory Distribution Matching Paper • 2503.06674 • Published Mar 9 • 7
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published about 1 month ago • 68
Running 2.44k 2.44k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think Paper • 2503.00948 • Published Mar 2 • 3
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think Paper • 2503.00948 • Published Mar 2 • 3
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Paper • 2502.16894 • Published Feb 24 • 28
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Paper • 2502.16894 • Published Feb 24 • 28