Spaces:

rasyosef
/

phi-2-chat

Running

rasyosef commited on Jul 20, 2024

Commit

b8340d0

verified ·

1 Parent(s): cc27751

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -64,7 +64,7 @@ with gr.Blocks() as demo:
   In order to reduce the response time on this hardware, `max_new_tokens` has been set to `21` in the text generation pipeline. With this default configuration, it takes approximately `60 seconds` for the response to start being generated, and streamed one word at a time. Use the slider below to increase or decrease the length of the generated text.
   """)
-  tokens_slider = gr.Slider(8, 128, value=21, render=False, label="Maximum new tokens", info="A larger `max_new_tokens` parameter value gives you longer text responses but at the cost of a slower response time.")
   chatbot = gr.ChatInterface(
     fn=generate,

   In order to reduce the response time on this hardware, `max_new_tokens` has been set to `21` in the text generation pipeline. With this default configuration, it takes approximately `60 seconds` for the response to start being generated, and streamed one word at a time. Use the slider below to increase or decrease the length of the generated text.
   """)
+  tokens_slider = gr.Slider(8, 128, value=21, render=True, label="Maximum new tokens", info="A larger `max_new_tokens` parameter value gives you longer text responses but at the cost of a slower response time.")
   chatbot = gr.ChatInterface(
     fn=generate,