update metrics
Browse files
app.py
CHANGED
@@ -22,8 +22,8 @@ The evaluation is conducted on 8 datasets across 4 tasks:
|
|
22 |
- TyDiQA (Thai only), contains 763 test samples, https://huggingface.co/datasets/chompk/tydiqa-goldp-th
|
23 |
## Metrics
|
24 |
The evaluation metrics for each task are as follows:
|
25 |
-
1. STS: Spearman’s
|
26 |
-
2. Text Classification: F1
|
27 |
3. Pair Classification: Average Precision
|
28 |
3. Retrieval: MMR@10
|
29 |
"""
|
|
|
22 |
- TyDiQA (Thai only), contains 763 test samples, https://huggingface.co/datasets/chompk/tydiqa-goldp-th
|
23 |
## Metrics
|
24 |
The evaluation metrics for each task are as follows:
|
25 |
+
1. STS: Spearman’s Rank Correlation
|
26 |
+
2. Text Classification: F1 Score
|
27 |
3. Pair Classification: Average Precision
|
28 |
3. Retrieval: MMR@10
|
29 |
"""
|