Spaces:

LLM360
/

de-arena

Running

yzabc007 commited on Oct 11

Commit

a40f62f

•

1 Parent(s): b84a05a

Update space

Files changed (1) hide show

app.py CHANGED Viewed

@@ -141,7 +141,7 @@ with demo:
         'for large language model (LLM) evaluation across diverse, fine-grained dimensions, '
         'such as mathematics (algebra, geometry, probability), logical reasoning, social reasoning, science (chemistry, physics, biology), or any user-defined dimensions. '
         'The evaluation is decentralized and democratic, with all participating LLMs assessing each other to ensure unbiased and fair results. '
-        'With a 95\% correlation to Chatbot Arena\'s overall rankings, the system is fully transparent and reproducible.'
         '</p>'
         f'<p style="font-size:{INTRODUCTION_TEXT_FONT_SIZE}px;">'
         'We actively invite <b>model developers</b> to participate and expedite their benchmarking efforts '

         'for large language model (LLM) evaluation across diverse, fine-grained dimensions, '
         'such as mathematics (algebra, geometry, probability), logical reasoning, social reasoning, science (chemistry, physics, biology), or any user-defined dimensions. '
         'The evaluation is decentralized and democratic, with all participating LLMs assessing each other to ensure unbiased and fair results. '
+        'With a 95% correlation to Chatbot Arena\'s overall rankings, the system is fully transparent and reproducible.'
         '</p>'
         f'<p style="font-size:{INTRODUCTION_TEXT_FONT_SIZE}px;">'
         'We actively invite <b>model developers</b> to participate and expedite their benchmarking efforts '