jebadiah greenwood's picture

3

jebadiah greenwood

Jebadiah

·

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

Jebadiah/Aria-coder-plus-7b

published a model 3 days ago

Jebadiah/Aria-coder-plus-7b

updated a model 3 days ago

Jebadiah/Aria-rp-coder-7b

View all activity

Organizations

Jebadiah's activity

updated a model 3 days ago

Jebadiah/Aria-coder-plus-7b

Updated 3 days ago • 33 • 1

published a model 3 days ago

Jebadiah/Aria-coder-plus-7b

Updated 3 days ago • 33 • 1

updated a model 3 days ago

Jebadiah/Aria-rp-coder-7b

Updated 3 days ago • 22 • 1

published a model 3 days ago

Jebadiah/Aria-rp-coder-7b

Updated 3 days ago • 22 • 1

updated a model 4 days ago

Jebadiah/Aria-tree-35-coder-7b

Updated 4 days ago • 19

published a model 4 days ago

Jebadiah/Aria-tree-35-coder-7b

Updated 4 days ago • 19

updated a model 4 days ago

Jebadiah/Aria-tree-coder-7b

Updated 4 days ago • 5

published a model 4 days ago

Jebadiah/Aria-tree-coder-7b

Updated 4 days ago • 5

updated a model 4 days ago

Jebadiah/Aria-coder-7b

Updated 4 days ago • 189

published a model 4 days ago

Jebadiah/Aria-coder-7b

Updated 4 days ago • 189

updated 2 models 4 days ago

Jebadiah/Aria-ruby-v3

Text Generation • Updated 4 days ago • 112

Jebadiah/Qwen-2.5-base-tron-7b

Updated 4 days ago • 23

published a model 4 days ago

Jebadiah/Qwen-2.5-base-tron-7b

Updated 4 days ago • 23

updated a model 5 days ago

Jebadiah/Qwen-2.5-base-agentic-2-7b

Updated 5 days ago • 51

published a model 5 days ago

Jebadiah/Qwen-2.5-base-agentic-2-7b

Updated 5 days ago • 51

updated a model 7 days ago

Jebadiah/Qwen-2.5-base-7b

Updated 7 days ago • 109 • 1

published a model 7 days ago

Jebadiah/Qwen-2.5-base-7b

Updated 7 days ago • 109 • 1

updated a model 2 months ago

Jebadiah/Luna-dream-02

Updated Jan 6 • 24

New activity in featherless-ai/try-this-model 7 months ago

jeiku/Aura-NeMo-12B

#2 opened 7 months ago by

reacted to merve's post with 🚀 8 months ago

Post

3279

Forget any document retrievers, use ColPali 💥💥

Document retrieval is done through OCR + layout detection, but you are losing a lot of information in between, stop doing that! 🤓

ColPali uses a vision language model, which is better in doc understanding 📑
ColPali: vidore/colpali (mit license!)
Blog post: https://huggingface.co/blog/manu/colpali
The authors also released a new benchmark for document retrieval:
ViDoRe Benchmark: vidore/vidore-benchmark-667173f98e70a1c0fa4db00d
ViDoRe Leaderboard: vidore/vidore-leaderboard

ColPali marries the idea of modern vision language models with retrieval 🤝

The authors apply contrastive fine-tuning to SigLIP on documents, and pool the outputs (they call it BiSigLip). Then they feed the patch embedding outputs to PaliGemma and create BiPali 🖇️
BiPali natively supports image patch embeddings to an LLM, which enables leveraging the ColBERT-like late interaction computations between text tokens and image patches (hence the name ColPali!) 🤩

The authors created the ViDoRe benchmark by collecting PDF documents and generate queries from Claude-3 Sonnet.
ColPali seems to be the most performant model on ViDoRe. Not only this, but is way faster than traditional PDF parsers too!