Spaces:

sotirios-slv
/

whispering-angle

Sleeping

App Files Files Community

sotirios-slv commited on Jul 11, 2024

Commit

7e182ab

1 Parent(s): 04716fa

Added textbox for unprocessed transcription string

Browse files

Files changed (1) hide show

app.py +7 -9

app.py CHANGED Viewed

@@ -10,10 +10,6 @@ from transformers import (
 )
-# diction_text = """
-# How is this leisure to be disposed of? In the public-house? the singing hall? the dancing-saloon? which hold out seductions somewhat more dangerous, methinks, to honest labor than those presented by a library; or in listless inaction, in weary unoccupied solitude? That cannot be. While man is a social animal society he must have, and better a thousand times that he should seek relief from the tedium of unemployed hours in the improving conversation of worthy authors, dead or living, than in the debasing, brutalising communications from which it is so difficult otherwise to escape.
-# """
 diction_text = """
 How is this leisure to be disposed of? In the public-house? the singing hall? the dancing-saloon?
 which hold out seductions somewhat more dangerous, methinks, to honest labour than those presented by a library...
@@ -26,7 +22,6 @@ device = "cpu"
 torch_dtype = torch.float16 if torch.cuda.is_available() else torch.float32
-# model_id = "openai/whisper-base "
 model_id = "openai/whisper-large-v3"
 description = f"""
@@ -87,17 +82,20 @@ def transcribe_audio(diction_text, audio):
     formatted_score = f"{str(round(score,3))}%"
-    return diff_text, formatted_score
 highlighted_results = gr.HighlightedText(
     label="Diff",
     combine_adjacent=True,
     show_legend=True,
-    color_map={"+": "red", "-": "green"},
 )
-score = gr.Textbox("0%", label="Score")
 input_audio = gr.Audio(
@@ -113,7 +111,7 @@ input_audio = gr.Audio(
 demo = gr.Interface(
     fn=transcribe_audio,
-    inputs=[diction_script, input_audio],
     outputs=[highlighted_results, score],
     title="Redmond Barry-oke",
     description=description,

 )
 diction_text = """
 How is this leisure to be disposed of? In the public-house? the singing hall? the dancing-saloon?
 which hold out seductions somewhat more dangerous, methinks, to honest labour than those presented by a library...
 torch_dtype = torch.float16 if torch.cuda.is_available() else torch.float32
 model_id = "openai/whisper-large-v3"
 description = f"""
     formatted_score = f"{str(round(score,3))}%"
+    return diction_text, diff_text, formatted_score
+transcribed_text = gr.Textbox(label="Transcribed text")
 highlighted_results = gr.HighlightedText(
     label="Diff",
     combine_adjacent=True,
     show_legend=True,
+    color_map={"+": "green", "-": "red"},
 )
+score = gr.Textbox("0%", label="Barry-oke score")
 input_audio = gr.Audio(
 demo = gr.Interface(
     fn=transcribe_audio,
+    inputs=[transcribed_text, diction_script, input_audio],
     outputs=[highlighted_results, score],
     title="Redmond Barry-oke",
     description=description,