Sleeping
👀
as u said in the earlier post the biggest step up in open ai gpt 4o is the super model with very good computer vision capabilities,we need an open source alternative to that,,,,plzz try making the image to text model bigger
Why not use bigger computer vision model?i think we already reached enough improvement in language models.we need to focus on text to image and image to text models
Can u make it so that the text in the images generated be readable and accurate