|
--- |
|
library_name: peft |
|
base_model: NousResearch/Meta-Llama-3-8B-Instruct |
|
license: mit |
|
datasets: |
|
- sardukar/physiology-mcqa-8k |
|
language: |
|
- en |
|
--- |
|
|
|
# Model Card for Model ID |
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
This model is a 1 epoch training with ORPO Trainer on the [sardukar/physiology-mcqa-8k](https://huggingface.co/datasets/sardukar/physiology-mcqa-8k) dataset |
|
|
|
Base model is NousResearch/Meta-Llama-3-8B-Instruct |
|
|
|
**Training results** |
|
![train_results](physiology-8k-rtx3060-train-complete.png) |