Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published 18 days ago • 43
Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence Paper • 2406.10957 • Published Jun 16, 2024 • 1
Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring Paper • 2406.19949 • Published Jun 28, 2024 • 1
AERA Collection Resources for EMNLP 2023 Paper: Distilling ChatGPT for Explainable Automated Student Answer Assessment • 3 items • Updated Oct 14, 2024 • 1
MCTS with Preference Optimisation Collection Resources for EMNLP 2024 Paper: Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring • 8 items • Updated Oct 14, 2024 • 2
SamPO Collection Resources for EMNLP 2024 Paper: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence • 4 items • Updated Oct 14, 2024 • 2