Law of the Weakest Link: Cross Capabilities of Large Language Models Paper • 2409.19951 • Published 6 days ago • 48
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration Paper • 2410.02367 • Published 3 days ago • 12
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning Paper • 2410.01044 • Published 4 days ago • 34
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment Paper • 2410.01679 • Published 4 days ago • 17
view article Article wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR?? By catherinearnett • 9 days ago • 31
High-Resolution Image Synthesis with Latent Diffusion Models Paper • 2112.10752 • Published Dec 20, 2021 • 11
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing Paper • 2306.10012 • Published Jun 16, 2023 • 35
AudioPaLM: A Large Language Model That Can Speak and Listen Paper • 2306.12925 • Published Jun 22, 2023 • 52