--- license: apache-2.0 --- | Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss | | --- | --- | --- | --- | --- | --- | --- | | GerbilLab/Gerbil-D-3.3m | 3.3m | D-Class | 142 | 426M | 65.5k | 5.3307 |