MT Sentinel Metrics Collection Machine Translation (MT) metrics designed explicitly to scrutinize the MT meta-evaluation process’s accuracy, robustness, and fairness. • 7 items • Updated Dec 4, 2024 • 7
✍️ QE4PE & GroTE Collection Materials for "QE4PE: Word-level Quality Estimation for Human Post-Editing" • 3 items • Updated Mar 6 • 1
COMET-early-exit Collection Models introduced in the paper Early-Exit and Instant Confidence Translation Quality Estimation https://github.com/zouharvi/COMET-early-exit • 4 items • Updated Feb 21 • 2
Early-Exit and Instant Confidence Translation Quality Estimation Paper • 2502.14429 • Published Feb 20 • 4
PreCOMET Collection COMET-like models for MT evaluation that predict some scores given only the source segment. https://github.com/zouharvi/subset2evaluate • 8 items • Updated Feb 25 • 2
How to Select Datapoints for Efficient Human Evaluation of NLG Models? Paper • 2501.18251 • Published Jan 30 • 2