martimfasantos
/

tinyllama-1.1b-mt-dpo-full_LR5e-7_BS32_rmsprop_3epochs_test

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Resources

View closed (0)

Welcome to the community

The community tab is the place to discuss and collaborate with the HF community!