Abigail Annie Hélène Berthe--Pardo
commited on
Commit
·
3fce9d1
1
Parent(s):
3482ad8
Update README.md
Browse files
README.md
CHANGED
@@ -1,10 +1,13 @@
|
|
1 |
#Project Title: Talking car
|
|
|
2 |
A speaking assistant designed for in-car use, leveraging the LLaMA 2 model to facilitate vocal interactions between the car and its users. This notebook provides the foundation for a speech-enabled interface that can understand spoken questions and respond verbally, enhancing the driving experience with intelligent assistance.
|
3 |
|
4 |
##Description
|
|
|
5 |
This project integrates speech-to-text and text-to-speech functionalities into a car's infotainment system, using the LLaMA 2 model to process and respond to vocal queries from users. It employs Gradio for user interface creation, NexusRaven for function calling, and integrates various APIs to fetch real-time information, making it a comprehensive solution for creating a responsive and interactive car assistant.
|
6 |
|
7 |
##Features
|
|
|
8 |
• Speech-to-Text and Text-to-Speech: Enables the car assistant to listen to spoken questions and respond audibly, providing a hands-free experience for drivers and passengers.
|
9 |
• Intelligent Function Calling with NexusRaven: Implements a sophisticated system for executing commands and retrieving information based on user queries, using the LLaMA 2 model's capabilities.
|
10 |
• Dynamic Model Integration: Incorporates multiple models for language recognition, speech processing, and text generation.
|
@@ -12,12 +15,14 @@ This project integrates speech-to-text and text-to-speech functionalities into a
|
|
12 |
• Real-Time Information Retrieval: Capable of integrating with various APIs to provide up-to-date information on weather, routes, points of interest, and more.
|
13 |
|
14 |
##Requirements
|
|
|
15 |
• Gradio for creating interactive interfaces
|
16 |
• Hugging Face Transformers and additional ML models for speech and language processing
|
17 |
• NexusRaven for complex function execution
|
18 |
All required libraries and packages are directly loaded inside the notebook.
|
19 |
|
20 |
##Installation
|
|
|
21 |
To set up the speaking assistant in your car's system, follow these steps:
|
22 |
1. Run all the cells until the “Interfaces (text and audio)” section.
|
23 |
2. Choose between the interfaces which one to run: audio-to-audio or text-to-text.
|
@@ -28,6 +33,7 @@ Usage
|
|
28 |
4. Activation: Execute one of the interface to start the speaking assistant, enabling vocal interactions within the car.
|
29 |
|
30 |
## Authors and acknowledgment
|
|
|
31 |
Sasan Jafarnejad
|
32 |
Abigail Berthe--Pardo
|
33 |
|
|
|
1 |
#Project Title: Talking car
|
2 |
+
|
3 |
A speaking assistant designed for in-car use, leveraging the LLaMA 2 model to facilitate vocal interactions between the car and its users. This notebook provides the foundation for a speech-enabled interface that can understand spoken questions and respond verbally, enhancing the driving experience with intelligent assistance.
|
4 |
|
5 |
##Description
|
6 |
+
|
7 |
This project integrates speech-to-text and text-to-speech functionalities into a car's infotainment system, using the LLaMA 2 model to process and respond to vocal queries from users. It employs Gradio for user interface creation, NexusRaven for function calling, and integrates various APIs to fetch real-time information, making it a comprehensive solution for creating a responsive and interactive car assistant.
|
8 |
|
9 |
##Features
|
10 |
+
|
11 |
• Speech-to-Text and Text-to-Speech: Enables the car assistant to listen to spoken questions and respond audibly, providing a hands-free experience for drivers and passengers.
|
12 |
• Intelligent Function Calling with NexusRaven: Implements a sophisticated system for executing commands and retrieving information based on user queries, using the LLaMA 2 model's capabilities.
|
13 |
• Dynamic Model Integration: Incorporates multiple models for language recognition, speech processing, and text generation.
|
|
|
15 |
• Real-Time Information Retrieval: Capable of integrating with various APIs to provide up-to-date information on weather, routes, points of interest, and more.
|
16 |
|
17 |
##Requirements
|
18 |
+
|
19 |
• Gradio for creating interactive interfaces
|
20 |
• Hugging Face Transformers and additional ML models for speech and language processing
|
21 |
• NexusRaven for complex function execution
|
22 |
All required libraries and packages are directly loaded inside the notebook.
|
23 |
|
24 |
##Installation
|
25 |
+
|
26 |
To set up the speaking assistant in your car's system, follow these steps:
|
27 |
1. Run all the cells until the “Interfaces (text and audio)” section.
|
28 |
2. Choose between the interfaces which one to run: audio-to-audio or text-to-text.
|
|
|
33 |
4. Activation: Execute one of the interface to start the speaking assistant, enabling vocal interactions within the car.
|
34 |
|
35 |
## Authors and acknowledgment
|
36 |
+
|
37 |
Sasan Jafarnejad
|
38 |
Abigail Berthe--Pardo
|
39 |
|