distributed/optimized-gpt2-2b-without-stable-embeddings Text Generation • 2B • Updated Dec 24, 2024 • 8
distributed/optimized-gpt2-250m-convergence-test-v1 Text Generation • 0.3B • Updated Sep 24, 2024 • 17
distributed/optimized-gpt2-250m-convergence-test-v2 Text Generation • 0.3B • Updated Sep 24, 2024 • 6 • 1