Running 2.15k 2.15k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 134
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper β’ 2409.17146 β’ Published Sep 25, 2024 β’ 108
Running on Zero 469 469 Florence2 + SAM2 π₯ Segment objects in images and videos using text prompts
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images Paper β’ 2406.13735 β’ Published Jun 19, 2024 β’ 5
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing Paper β’ 2406.10601 β’ Published Jun 15, 2024 β’ 66
Running on Zero 743 743 Florence 2 π Analyze images to generate captions, detect objects, or perform OCR