11 38 159

dinhanhx

dinhanhx

AI & ML interests

Vision Language

Recent Activity

liked a model 1 day ago

CohereLabs/c4ai-command-r7b-12-2024

liked a model 20 days ago

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8

liked a model 28 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

View all activity

Organizations

dinhanhx's activity

upvoted a paper about 2 months ago

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 9

upvoted an article about 2 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 308

upvoted 3 articles 2 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 171

Article

Visual Document Retrieval Goes Multilingual

Jan 10

• 71

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 841

upvoted a paper 4 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 22

upvoted 3 collections 5 months ago

upvoted a collection 6 months ago

C4AI Aya Expanse

Collection

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 4 items • Updated Mar 2 • 38

upvoted an article 6 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 208

upvoted 2 collections 6 months ago

VisionLM

Collection

929 items • Updated about 13 hours ago • 56

Awesome Document AI

Collection

A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11, 2024 • 80

upvoted a paper 7 months ago

VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding

Paper • 2407.12594 • Published Jul 17, 2024 • 19

upvoted an article 7 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 188