baby-dev/44799a41-1833-4011-9f7e-539c7b12ccf8
This model was trained from scratch on the None dataset. It achieves the following results on the evaluation set:
- Loss: 0.5530
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Framework versions
- PEFT 0.13.2
- Transformers 4.46.0
- Pytorch 2.5.0+cu124
- Datasets 3.0.1
- Tokenizers 0.20.1
- Downloads last month
- 6
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no pipeline_tag.
Model tree for baby-dev/44799a41-1833-4011-9f7e-539c7b12ccf8
Base model
tokyotech-llm/Llama-3-Swallow-8B-v0.1