Benchmarking Cognitive Biases in Large Language Models as Evaluators Paper • 2309.17012 • Published Sep 29, 2023 • 1
Under the Surface: Tracking the Artifactuality of LLM-Generated Data Paper • 2401.14698 • Published Jan 26, 2024
A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications Paper • 1804.09635 • Published Apr 25, 2018
Read, Revise, Repeat: A System Demonstration for Human-in-the-loop Iterative Text Revision Paper • 2204.03685 • Published Apr 7, 2022
CoEdIT: Text Editing by Task-Specific Instruction Tuning Paper • 2305.09857 • Published May 17, 2023 • 7
Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning Paper • 2306.04925 • Published Jun 8, 2023
Improving Iterative Text Revision by Learning Where to Edit from Other Revision Tasks Paper • 2212.01350 • Published Dec 2, 2022
Benchmarking Cognitive Biases in Large Language Models as Evaluators Paper • 2309.17012 • Published Sep 29, 2023 • 1