Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge Paper • 2502.12501 • Published 19 days ago • 6 • 2
NILE: Internal Consistency Alignment in Large Language Models Paper • 2412.16686 • Published Dec 21, 2024 • 8 • 2
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References Paper • 2410.05193 • Published Oct 7, 2024 • 13 • 3
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References Paper • 2410.05193 • Published Oct 7, 2024 • 13 • 3