sail
/

Llama-3-Base-8B-DICE-Iter2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3-Base-8B-DICE-Iter2 / README.md

Cameron-Chen's picture

Update README.md

661981a verified 5 months ago

|

336 Bytes

	---
	library_name: transformers
	license: mit
	datasets:
	- HuggingFaceH4/ultrafeedback_binarized
	language:
	- en
	---
	This is a model released from the preprint: [Bootstrapping Language Models with DPO Implicit Rewards](https://arxiv.org/abs/2406.09760). Please refer to our [repository](https://github.com/sail-sg/dice) for more details.