Filippo B's picture

7 4 1

Filippo B

Filippo

·

https://www.filippobroggini.com/

AI & ML interests

GenAI, LLMs, VLMs, accelerated computing, information retrieval, workflows orchestration

Recent Activity

updated a Space 13 days ago

Filippo/First_agent_template

updated a collection 22 days ago

liked a Space about 1 month ago

nanotron/ultrascale-playbook

View all activity

Organizations

Filippo's activity

updated a Space 13 days ago

First Agent Template

Get current local time in any timezone

updated a collection 22 days ago

LLMs and such

3 items • Updated 22 days ago

liked a Space about 1 month ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

updated a collection about 2 months ago

LLMs and such

3 items • Updated 22 days ago

upvoted a paper about 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 218

updated 2 collections 6 months ago

Search and retrieval

1 item • Updated Oct 10, 2024

Vision Language Models (VLMs)

1 item • Updated Oct 2, 2024

upvoted a paper 6 months ago

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 46

reacted to merve's post with 🔥 6 months ago

Post

5636

I have put together a notebook on Multimodal RAG, where we do not process the documents with hefty pipelines but natively use:
- vidore/colpali for retrieval 📖 it doesn't need indexing with image-text pairs but just images!
- Qwen/Qwen2-VL-2B-Instruct for generation 💬 directly feed images as is to a vision language model with no processing to text!
I used ColPali implementation of the new 🐭 Byaldi library by @bclavie 🤗
https://github.com/answerdotai/byaldi
Link to notebook: https://github.com/merveenoyan/smol-vision/blob/main/ColPali_%2B_Qwen2_VL.ipynb

New activity in Filippo/distilabel-intel-orca-dpo-pairs-filtered about 1 year ago

add distilabel and synthethic tag

#2 opened about 1 year ago by

davidberenstein1957

updated a dataset about 1 year ago

Filippo/distilabel-intel-orca-dpo-pairs-filtered

Viewer • Updated Feb 6, 2024 • 5.92k • 56

updated a collection about 1 year ago

Datasets for LLMs

4 items • Updated Feb 2, 2024