test loss 2.563950 on crumb/flan-ul2-tinystories-complex, initialized from crumb/opentinystories-30m-base, 2 epochs, linear decreasing lr 1e-4. trained with double the batch size (256)

Downloads last month
65
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Datasets used to train crumb/opentinystories-30m-complex