Spaces:
Running
Running
hamaadayubkhan
commited on
Commit
•
ca2de89
1
Parent(s):
c5d07bf
Update app.py
Browse files
app.py
CHANGED
@@ -78,21 +78,47 @@ def chatbot(audio=None, user_text=None, voice="en"):
|
|
78 |
# Define the About app section
|
79 |
def about_app():
|
80 |
about_text = """
|
81 |
-
|
82 |
-
|
83 |
-
|
84 |
-
|
85 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
86 |
"""
|
87 |
return about_text
|
88 |
|
|
|
|
|
89 |
# Gradio interface for real-time interaction with voice selection
|
90 |
with gr.Blocks(css="style.css") as iface: # Include the CSS file here
|
91 |
gr.Markdown("# Voicesy AI")
|
92 |
-
|
93 |
-
# Add logo at the top
|
94 |
-
gr.Image("IMG-20241024-WA0010.jpg", label="Voicesy AI") # Update the path to your logo image
|
95 |
-
|
96 |
# Tab for Voice to Voice
|
97 |
with gr.Tab("Voice to Voice"):
|
98 |
audio_input = gr.Audio(type="filepath", label="Input Audio (optional)") # Input from mic or file
|
|
|
78 |
# Define the About app section
|
79 |
def about_app():
|
80 |
about_text = """
|
81 |
+
# About Voicesy AI
|
82 |
+
|
83 |
+
**Voicesy AI** is a cutting-edge real-time chatbot and voice conversion application developed by **Hamaad Ayub Khan**. This innovative app combines advanced artificial intelligence technologies to provide users with a seamless interaction experience through both voice and text.
|
84 |
+
|
85 |
+
## Purpose
|
86 |
+
Voicesy AI is designed to facilitate easy communication by allowing users to convert spoken language into text and vice versa. The app is particularly beneficial for individuals who prefer voice interaction or have difficulties typing. Whether you're on the go or need assistance with accessibility, Voicesy AI makes communication effortless and engaging.
|
87 |
+
|
88 |
+
## Features
|
89 |
+
- **Voice-to-Voice Interaction**: Users can engage in conversations by speaking, and the app will transcribe audio input into text, process it, and provide a spoken response.
|
90 |
+
- **Text-to-Speech Conversion**: Users can type their messages, which the app will convert into speech, allowing for easy listening and understanding.
|
91 |
+
- **Language Support**: The app supports multiple voice options and languages, catering to a diverse user base.
|
92 |
+
- **Intelligent Responses**: Powered by the latest language models, Voicesy AI offers relevant and context-aware responses, enhancing the interaction experience.
|
93 |
+
|
94 |
+
## Technologies Used
|
95 |
+
- **Whisper**: An automatic speech recognition (ASR) model developed by OpenAI, Whisper enables accurate transcription of spoken language into text, ensuring that voice input is understood correctly.
|
96 |
+
- **gTTS (Google Text-to-Speech)**: This library is utilized for converting text responses generated by the AI into spoken audio, making it possible for users to listen to responses in real time.
|
97 |
+
- **Groq**: This powerful AI model integration allows Voicesy AI to generate intelligent conversational responses based on user input, creating a more engaging user experience.
|
98 |
+
- **Gradio**: The app is built using Gradio, a user-friendly framework that simplifies the creation of web-based interfaces for machine learning applications. Gradio enables rapid prototyping and easy deployment of the app, ensuring a smooth user experience.
|
99 |
+
|
100 |
+
## Development
|
101 |
+
Voicesy AI was developed with a focus on accessibility and user experience. Hamaad Ayub Khan utilized a variety of programming languages and frameworks to bring this project to life, including Python for backend development and Gradio for frontend interface design. Continuous testing and refinement were performed to ensure the app operates efficiently and effectively.
|
102 |
+
|
103 |
+
## Disclaimer
|
104 |
+
While Voicesy AI leverages advanced AI technologies, it is important to note that the AI may make mistakes. Users are encouraged to verify critical information and use the app as a supportive tool rather than a definitive source.
|
105 |
+
|
106 |
+
## Contact
|
107 |
+
For any inquiries or feedback regarding Voicesy AI, please reach out via the following social media links:
|
108 |
+
|
109 |
+
- [Instagram](https://instagram.com/hamaadayubkhan)
|
110 |
+
- [GitHub](https://github.com/hakgs1234)
|
111 |
+
- [LinkedIn](https://www.linkedin.com/in/hamaadayubkhan)
|
112 |
+
|
113 |
+
**Thank you for using Voicesy AI!**
|
114 |
"""
|
115 |
return about_text
|
116 |
|
117 |
+
|
118 |
+
|
119 |
# Gradio interface for real-time interaction with voice selection
|
120 |
with gr.Blocks(css="style.css") as iface: # Include the CSS file here
|
121 |
gr.Markdown("# Voicesy AI")
|
|
|
|
|
|
|
|
|
122 |
# Tab for Voice to Voice
|
123 |
with gr.Tab("Voice to Voice"):
|
124 |
audio_input = gr.Audio(type="filepath", label="Input Audio (optional)") # Input from mic or file
|