Spaces:

CZLC
/

BenCzechMark

Running

App Files Files Community

mfajcik commited on Dec 25, 2024

Commit

4d3502a

verified ·

1 Parent(s): 5f42dfe

Update content.py

Browse files

Files changed (1) hide show

content.py +10 -8

content.py CHANGED Viewed

@@ -19,8 +19,9 @@ Here, you can compare models on tasks in the Czech language or submit your own m
 - On the submission page, __you can view your model's results on the leaderboard without publishing them__.
     - The first step is "pre-submission." After this is complete (significance tests may take up to 2 hours), you can choose to submit the results if you wish.
 - NEWS:
-  - 1.10.2024: Find out more about 🇨🇿 BenCzechMark in our [Huggingface blogpost](https://huggingface.co/blog/benczechmark)!
   - 7.11.2024: We acknowledge that one of the Qwen2.5 models correctly predicted our (& Bigbench's) canary string. This confirms the contamination, it was trained on benchmark data. Other [studies](https://arxiv.org/pdf/2409.01790) also suggest the contamination issues of the Qwen family.
 """
 LEADERBOARD_TAB_TITLE_MARKDOWN = """
@@ -131,12 +132,14 @@ The models submitted to leaderboard by the authors were evaluated in following s
 ## Citation
 You can use the following citation for this leaderboard and our upcoming work.
 ```bibtex
-@article{2024benczechmark,
-  title = {{B}en{C}zech{M}ark: A Czech-centric Multitask and Multimetric Benchmark for Language Models with Duel Scoring Mechanism},
-  author = {Martin Fajcik and Martin Docekal and Jan Dolezal and Karel Ondrej and Karel Benes and Jan Kapsa and Michal Hradis and Zuzana Neverilova and Ales Horak and Michal Stefanik and Adam Jirkovsky and David Adamczyk and Jan Hula and Jan Sedivy and Hynek Kydlicek},
-  year = {2024},
-  url = {https://huggingface.co/spaces/CZLC/BenCzechMark}
-  institution = {Brno University of Technology, Masaryk University, Czech Technical University in Prague, Hugging Face},
 }
 ```
@@ -159,7 +162,6 @@ You can use the following citation for this leaderboard and our upcoming work.
     - Adam Jirkovský
     - David Adamczyk
     - Jan Hůla
-    - Jan Šedivý
   - **Hugging Face**
     - Hynek Kydlíček

 - On the submission page, __you can view your model's results on the leaderboard without publishing them__.
     - The first step is "pre-submission." After this is complete (significance tests may take up to 2 hours), you can choose to submit the results if you wish.
 - NEWS:
+  - 23.12.2024: We released [a preprint](http://arxiv.org/abs/2412.17933) detailing our work.
   - 7.11.2024: We acknowledge that one of the Qwen2.5 models correctly predicted our (& Bigbench's) canary string. This confirms the contamination, it was trained on benchmark data. Other [studies](https://arxiv.org/pdf/2409.01790) also suggest the contamination issues of the Qwen family.
+  - 1.10.2024: Find out more about 🇨🇿 BenCzechMark in our [Huggingface blogpost](https://huggingface.co/blog/benczechmark)!
 """
 LEADERBOARD_TAB_TITLE_MARKDOWN = """
 ## Citation
 You can use the following citation for this leaderboard and our upcoming work.
 ```bibtex
+@misc{benczechmark,
+      title={BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism},
+      author={Martin Fajcik and Martin Docekal and Jan Dolezal and Karel Ondrej and Karel Beneš and Jan Kapsa and Pavel Smrz and Alexander Polok and Michal Hradis and Zuzana Neverilova and Ales Horak and Radoslav Sabol and Michal Stefanik and Adam Jirkovsky and David Adamczyk and Petr Hyner and Jan Hula and Hynek Kydlicek},
+      year={2024},
+      eprint={2412.17933},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2412.17933},
 }
 ```
     - Adam Jirkovský
     - David Adamczyk
     - Jan Hůla
   - **Hugging Face**
     - Hynek Kydlíček