Exciting news! Introducing super-fast AI video assistant, currently in beta. With a minimum latency of under 500ms and an average latency of just 600ms.
I am experimenting with Flux and trying to push it to its limits without training (as I am GPU-poor ๐ ). I found some flaws in the pipelines, which I resolved, and now I am able to generate an approx similar quality image as Flux Schnell 4 steps in just 1 step. Demo Link: KingNish/Realtime-FLUX
Introducing Voicee, A superfast voice fast assistant. KingNish/Voicee It achieved latency <500 ms. While its average latency is 700ms. It works best in Google Chrome. Please try and give your feedbacks. Thank you. ๐ค
This feature enhances the capabilities of OpenGPT 4o, allowing it to fetch and integrate the latest information from the web directly into its responses. Try Now: KingNish/OpenGPT-4o
With WEB SEARCH, OpenGPT 4o becomes an even more versatile and dynamic AI, ready to assist with up-to-date data retrieval and analysis.
1. Chat with Google Agent - This includes three AI models that allow you to converse with an AI, which provides answers by searching Google. Demo Link: poscye/google-go
Yes, you can use them but... with limitations like You can't use DallE ๐ฅ, You can't make Custom GPTs And chat limit also๐ฅ. But... We already have an open-source alternative like Hugging Chat, where you can create your custom assistant, generate, edit images, without any chat limit.
Future Updates: 1. Web Search (Suggested by @GPT007 and @Saionton ) 2. Live Chat with Voice Chat 3. Model Choices (Suggested by @NotAiLOL ) 4. Multilingual Chats.
Suggest more features that should be added. ๐ค Thanks!
2. Phi 3 Mini Vision 128k: A 4.5 billion-parameter, instruction-tuned vision model that has outperformed models such as Llava3 and Claude 3, and is providing stiff competition to Gemini 1Pro Vision. microsoft/Phi-3-vision-128k-instruct
๐๐ฎ๐ฆ๐ฆ๐๐ซ๐ฒ ๐จ๐ ๐๐ซ๐ญ๐ข๐๐ฅ๐- ๐ # ๐๐๐๐ก๐๐ง๐ข๐๐ฌ ๐จ๐ ๐๐๐-๐โ๐จโ: GPT-4โoโ operates through three main components ๐ ๏ธ
๐. ๐๐ฎ๐ฉ๐๐ซ๐๐ก๐๐ญ: Integrates image generation, QnA (image, document and video) for diverse interactions. ๐. ๐๐จ๐ข๐๐ ๐๐ก๐๐ญ: Merges TTS and STT for real-time, human-like audio responses, focusing on human interaction. ๐. ๐๐ข๐๐๐จ ๐๐ก๐๐ญ: Utilizes Zero Shot Image Classification to enhance user interaction with visual information.
๐. ๐๐ฎ๐ฅ๐ญ๐ข๐๐จ๐๐๐ฅ๐ข๐๐ข๐๐๐ญ๐ข๐จ๐ง: Combines multiple models for a powerful, multifunctional AI. ๐. ๐๐ฎ๐๐ญ ๐๐๐ฉ๐ ๐๐๐ญ๐ก๐จ๐: Uses different models or APIs for specific tasks without additional training.
The article provides an in-depth exploration of GPT-4โoโ, its functionalities, and methods to create similar AI models. It emphasizes the modelโs language support and its innovative approach to human-AI interaction. ๐ก๐
New Updates OpenGPT 4o 1. Live Chat (also known as video chat) (very powerful and fast, it can even identify famous places and persons) 2. Powerful Image Generation
Today, I gained access to GPT-4o, so I thought to test it. However, I encountered several problems, such as When I requested image generation, it did not create any images but only provided links, which are also incorrect. ๐ฅ [Image 1]
Subsequently, I considered that my prompt might be incorrect, I attempted once more with a prompt from OpenAI's examples, but it also did not work. ๐ฅ [Image 2]
Then, I tested its logical reasoning skills, which it failed. I presented a question that an 8b model solved with ease, but GPT-4o could not. ๐ฅ [Image 3]
I also attempted to generate an image from another image, but this too was unsuccessful. [image 4]
Nonetheless, it excels in tasks such as image classification and voice chat.
If you've experienced similar issues, please share them here.
Features: 1๏ธโฃ Inputs possible are Text โ๏ธ, Text + Image ๐๐ผ๏ธ, Audio ๐ง, WebCam๐ธ and outputs possible are Image ๐ผ๏ธ, Image + Text ๐ผ๏ธ๐, Text ๐, Audio ๐ง 2๏ธโฃ Flat 100% FREE ๐ธ and Super-fast โก. 3๏ธโฃ Publicly Available before GPT 4o.
Future Features: 1๏ธโฃ Chat with PDF (Both voice and text) 2๏ธโฃ Video generation. 3๏ธโฃ Sequential Image Generation. 4๏ธโฃ Better UI and customization.
Note: It's not possible to reach level of complexity of GPT 4o because OpenAI has been developing GPT-4o from six months with a team of over 450+ experienced members, Whereas I am only One. Moreover, they haven't released it fully publicly, So, it remains a test model.