Dana Aubakirova's picture
2 9

Dana Aubakirova

danaaubakirova

AI & ML interests

DocumentAI, Deep Learning, Multimodal Learning, Computer Vision, Image Processing, NLP

Recent Activity

Articles

Organizations

Hugging Face's profile picture Hugging Face Internal Testing Organization's profile picture Social Post Explorers's profile picture Dev Mode Explorers's profile picture

danaaubakirova's activity

upvoted 2 articles 4 months ago
view article
Article

Introducing TextImage Augmentation for Document Images

32
posted an update 5 months ago
view post
Post
872
🚀 We are thrilled to introduce TextImage Data Augmentation, developed in collaboration with Albumentations AI! ✨ This multimodal technique modifies document images and text simultaneously, enhancing Vision Language Models (VLMs) for high-text datasets.

👩‍💻 Learn how this innovative approach can improve your document AI projects by checking out our full blog post here: https://huggingface.co/blog/doc_aug_hf_alb
  • 1 reply
·
upvoted an article 5 months ago
view article
Article

LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?

18
New activity in mPLUG/DocOwl 6 months ago

Update model_worker.py

#4 opened 6 months ago by
danaaubakirova
upvoted 2 articles 7 months ago
posted an update 7 months ago
upvoted an article 7 months ago
view article
Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

226
published an article 8 months ago
view article
Article

Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task

17