tdirkse-nfi
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -84,7 +84,7 @@ In evaluating the performance of the model, we consider two factors as important
|
|
84 |
* The model may encounter firework categories for which it has seen (real) snippets during training (‘best case’), or snippets for categories that are not present in the train set (‘worst case').
|
85 |
|
86 |
To capture the difference in performance across conditions, we construct a separate test set for the lab snippets and the mock-crime scene snippets.
|
87 |
-
For the lab snippets, we split the test set into two parts: one for which
|
88 |
As the mock-crime scene dataset only consists of 7 classes, we are unable to construct a worst-case test set – so we only report best-case performance for this dataset.
|
89 |
In practice, a drop in performance may of course be expected in the worst-case scenario for (mock-)crime scene snippets.
|
90 |
Overall, we find that the model performs very well for classes that are present in the train set, and that the text filter gives a significant boost if this is not the case.
|
|
|
84 |
* The model may encounter firework categories for which it has seen (real) snippets during training (‘best case’), or snippets for categories that are not present in the train set (‘worst case').
|
85 |
|
86 |
To capture the difference in performance across conditions, we construct a separate test set for the lab snippets and the mock-crime scene snippets.
|
87 |
+
For the lab snippets, we split the test set into two parts: one for which categories are present in the train set (best-case) and a second part for which they are not (worst-case).
|
88 |
As the mock-crime scene dataset only consists of 7 classes, we are unable to construct a worst-case test set – so we only report best-case performance for this dataset.
|
89 |
In practice, a drop in performance may of course be expected in the worst-case scenario for (mock-)crime scene snippets.
|
90 |
Overall, we find that the model performs very well for classes that are present in the train set, and that the text filter gives a significant boost if this is not the case.
|