Gerbil-D-3.3m / README.md
crumb's picture
Create README.md
cae3f07
metadata
license: apache-2.0
Model Name Parameters Class Ratio Tokens Batch Size (Tokens) Training Loss
GerbilLab/Gerbil-D-3.3m 3.3m D-Class 142 426M 65.5k 5.3307