Portuguese LLM Leaderboard best models ā¤ļøāš„ Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: ⢠17 items ⢠Updated 44 minutes ago ⢠31
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper ⢠2305.18290 ⢠Published May 29, 2023 ⢠56
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook ⢠9 items ⢠Updated Apr 12, 2024 ⢠149