Team-PIXEL

university

https://github.com/xplip/pixel

Activity Feed Request to join this org

AI & ML interests

Language modelling with pixels

Team-PIXEL's activity

e-bug

authored a paper 2 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 126

lyan62

authored 3 papers 7 months ago

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

Paper • 2406.11030 • Published Jun 16, 2024

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

Paper • 2406.02265 • Published Jun 4, 2024 • 6

The Role of Data Curation in Image Captioning

Paper • 2305.03610 • Published May 5, 2023

e-bug

authored a paper 7 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 68

e-bug

authored a paper 10 months ago

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Paper • 2404.16820 • Published Apr 25, 2024 • 16

plip

updated a Space 10 months ago

PIXEL

🐱

ilkerkesen

authored a paper about 1 year ago

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

Paper • 2311.07022 • Published Nov 13, 2023 • 1

jflotz

updated 4 datasets about 1 year ago

elliottd

authored a paper over 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 11

plip

authored a paper over 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 11

esalesky

authored a paper over 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 11

jflotz

authored a paper over 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 11

jflotz

updated 2 datasets over 1 year ago

Team-PIXEL/bigrams_wiki-en_529

Viewer • Updated Oct 2, 2023 • 18.4M • 245

Team-PIXEL/bigrams_bookcorpus_529

Viewer • Updated Oct 2, 2023 • 9.81M • 114

e-bug

authored a paper over 1 year ago

Measuring Progress in Fine-grained Vision-and-Language Understanding

Paper • 2305.07558 • Published May 12, 2023 • 1

jflotz

updated a model over 1 year ago

Team-PIXEL/pixel-base-bigrams

Updated May 11, 2023 • 635

AI & ML interests

Team members 13

Team-PIXEL's activity

PIXEL