VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper • 2412.21059 • Published Dec 30, 2024 • 19
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training Paper • 2311.04155 • Published Nov 7, 2023 • 1
CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation Paper • 2311.18702 • Published Nov 30, 2023
AlignBench: Benchmarking Chinese Alignment of Large Language Models Paper • 2311.18743 • Published Nov 30, 2023 • 1
On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark Paper • 2110.08466 • Published Oct 16, 2021
PAL: Persona-Augmented Emotional Support Conversation Generation Paper • 2212.09235 • Published Dec 19, 2022
Recent Advances towards Safe, Responsible, and Moral Dialogue Systems: A Survey Paper • 2302.09270 • Published Feb 18, 2023
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models Paper • 2408.15778 • Published Aug 28, 2024
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models Paper • 2412.11605 • Published Dec 16, 2024 • 18
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models Paper • 2412.11605 • Published Dec 16, 2024 • 18
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models Paper • 2412.11605 • Published Dec 16, 2024 • 18 • 2