Spaces:

AtlaAI
/

judge-arena

Restarting

kaikaidai commited on Nov 22, 2024

Commit

f2d7524

verified ·

1 Parent(s): d31079e

Renamed accordion to "Edit Judge Prompt" & added messaging around "Turbo" models

Files changed (1) hide show

app.py CHANGED Viewed

@@ -386,7 +386,7 @@ with gr.Blocks(theme="default", css=CSS_STYLES) as demo:
             gr.Markdown("<br>")
             # Update Evaluator Prompt Accordion
-            with gr.Accordion("📝 Judge Prompt", open=False):
                 eval_prompt_editable = gr.TextArea(
                     value=DEFAULT_EVAL_PROMPT_EDITABLE,
                     label="Evaluation Criteria",
@@ -413,6 +413,13 @@ with gr.Blocks(theme="default", css=CSS_STYLES) as demo:
                 datatype=["str", "number", "str", "number", "str", "str", "str"],
             )
             # Add change handler for checkbox
             show_preliminary.change(
                 fn=refresh_leaderboard,

             gr.Markdown("<br>")
             # Update Evaluator Prompt Accordion
+            with gr.Accordion("📝 Edit Judge Prompt", open=False):
                 eval_prompt_editable = gr.TextArea(
                     value=DEFAULT_EVAL_PROMPT_EDITABLE,
                     label="Evaluation Criteria",
                 datatype=["str", "number", "str", "number", "str", "str", "str"],
             )
+            gr.Markdown("""<br>
+                        <br>
+                        Judge Arena uses Together AI for inference of open-source models. FP8 models are named as -- "Turbo" where the performance of the FP16 reference models is closely matched:
+                        [*"Together Turbo achieves this performance while maintaining full accuracy compared to Meta's reference implementation across all models. Llama-3.1-405B-Instruct-Turbo matches the accuracy of Meta reference models."*](https://www.together.ai/blog/together-inference-engine-2)
+            """)
             # Add change handler for checkbox
             show_preliminary.change(
                 fn=refresh_leaderboard,