Jimin Huang
commited on
Commit
•
75fb8ba
1
Parent(s):
527b158
feat: modify about.py
Browse files- src/about.py +9 -9
src/about.py
CHANGED
@@ -121,15 +121,15 @@ Our evaluation metrics include, but are not limited to, Accuracy, F1 Score, ROUG
|
|
121 |
- **BigData22**: Acc
|
122 |
- **ACL18**: Acc
|
123 |
- **CIKM18**: Acc
|
124 |
-
- **German**:
|
125 |
-
- **Australian**:
|
126 |
-
- **LendingClub**:
|
127 |
-
- **ccf**:
|
128 |
-
- **ccfraud**:
|
129 |
-
- **polish**:
|
130 |
-
- **taiwan**:
|
131 |
-
- **portoseguro**:
|
132 |
-
- **travelinsurance**:
|
133 |
|
134 |
To ensure a fair and unbiased assessment of the models' true capabilities, all evaluations are conducted in zero-shot settings (0-shots). This approach eliminates any potential advantage from task-specific fine-tuning, providing a clear indication of how well the models can generalize to new tasks.
|
135 |
|
|
|
121 |
- **BigData22**: Acc
|
122 |
- **ACL18**: Acc
|
123 |
- **CIKM18**: Acc
|
124 |
+
- **German**: MCC
|
125 |
+
- **Australian**: MCC
|
126 |
+
- **LendingClub**: MCC
|
127 |
+
- **ccf**: MCC
|
128 |
+
- **ccfraud**: MCC
|
129 |
+
- **polish**: MCC
|
130 |
+
- **taiwan**: MCC
|
131 |
+
- **portoseguro**: MCC
|
132 |
+
- **travelinsurance**: MCC
|
133 |
|
134 |
To ensure a fair and unbiased assessment of the models' true capabilities, all evaluations are conducted in zero-shot settings (0-shots). This approach eliminates any potential advantage from task-specific fine-tuning, providing a clear indication of how well the models can generalize to new tasks.
|
135 |
|