@KingNish on Hugging Face: "Introducing OpenGPT-4o https://huggingface.co/spaces/KingNish/OpenGPT-4o…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

KingNish

posted an update May 14

Post

5047

Introducing OpenGPT-4o
KingNish/OpenGPT-4o

Features:
1️⃣ Inputs possible are Text ✏️, Text + Image 📝🖼️, Audio 🎧, WebCam📸
and outputs possible are Image 🖼️, Image + Text 🖼️📝, Text 📝, Audio 🎧
2️⃣ Flat 100% FREE 💸 and Super-fast ⚡.
3️⃣ Publicly Available before GPT 4o.

Future Features:
1️⃣ Chat with PDF (Both voice and text)
2️⃣ Video generation.
3️⃣ Sequential Image Generation.
4️⃣ Better UI and customization.

Note: It's not possible to reach level of complexity of GPT 4o because OpenAI has been developing GPT-4o from six months with a team of over 450+ experienced members, Whereas I am only One. Moreover, they haven't released it fully publicly, So, it remains a test model.

julien-c

May 14

this is working quite well!

osanseviero

May 14

I tried with the OAI example and it worked nicely! https://huggingface.co/spaces/KingNish/GPT-4o/discussions/1

Neilblaze

May 14

This is amazing!

victor

May 14

Out of curiosity did you use dev mode while building it?

KingNish

May 14

Yes, but how you know

PeepDaSlan9

May 14

I tried it

KingNish

May 15

any suggestions

AshScholar

May 17

This comment has been hidden

alybadara1803

May 17

what model did you use to build it ?
And is it possible to make a blog on how did you make it ?

KingNish

May 18

•

edited May 20

Super Chat Model - Idefics 2
Image Generation Model - Pollination Ai Api
Speech to Text - Nemo (API)
Voice Chat (Base Model) - Mixtral 8x7b (Inference API)
Text to Speech - Edge tts (API)
Live Chat (base model) - uform gen2 dpo