Model Details

Model and Training Details

Preprocessing

  • preprocessed and packed the sft dataset with trl.trainer.ConstantLengthDataset

Results

image/png

Compute Infrastructure

The model is trained using 4 * RTX 3090 - 24GB

Model Card Authors

Yiyu (Michael) Ren

Model Card Contact

Email: [email protected]

Framework versions

  • PEFT 0.8.2
Downloads last month
0
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for renyiyu/llama-2-7b-sft-lora

Adapter
(1760)
this model