Gerbil-A-3.3m / README.md
crumb's picture
Update README.md
3736549
metadata
license: apache-2.0
Model Name Parameters Class Ratio Tokens Batch Size (Tokens) Training Loss
GerbilLab/Gerbil-A-3.3m 3.3m A-Class 20 60M 65.5k 6.664400