Federico Cocchi's picture

4 1 9

Federico Cocchi

fede97

·

https://federico1-creator.github.io/Federico_Cocchi/

federico1-creator

AI & ML interests

Multimodal LLM - Computer Vision

Recent Activity

upvoted a paper 2 days ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

liked a Space 4 days ago

nanotron/ultrascale-playbook

updated a collection 4 days ago

View all activity

Organizations

fede97's activity

upvoted a paper 2 days ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 5 days ago • 105

liked a Space 4 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

updated a collection 4 days ago

ReflectiVA

Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025] • 2 items • Updated 4 days ago

updated a dataset 10 days ago

aimagelab/ReflectiVA-Data

Preview • Updated 10 days ago • 109

New activity in aimagelab/ReflectiVA-Data 11 days ago

Add task category, link to paper and Github repository

#1 opened 13 days ago by

updated a collection 11 days ago

ReflectiVA

Models and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025] • 3 items • Updated 11 days ago

updated a model 11 days ago

aimagelab/ReflectiVA

Image-Text-to-Text • Updated 11 days ago • 51 • 2

New activity in aimagelab/ReflectiVA 11 days ago

Add links to Github repository, project page and dataset

#1 opened 13 days ago by

New activity in itserr/LatinGPT_alpha-01 16 days ago

Update app.py

#1 opened 16 days ago by

updated a Space 22 days ago

LatinGPT

LatinGPT

updated a collection 22 days ago

ReflectiVA

Models and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025] • 3 items • Updated 11 days ago

published a dataset 22 days ago

aimagelab/ReflectiVA-Data

Preview • Updated 10 days ago • 109

liked a Space 22 days ago

AI Deadlines

Schedule tasks efficiently using AI-generated deadlines

authored a paper 23 days ago

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Paper • 2503.15621 • Published 28 days ago

updated a model 4 months ago

aimagelab/CoDE

Image Feature Extraction • Updated Dec 12, 2024 • 1.04k • 2

authored a paper 5 months ago

Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

Paper • 2411.16863 • Published Nov 25, 2024

updated a collection 5 months ago

ELSA EU Project

Dataset and models created inside the ELSA – European Lighthouse on Secure and Safe AI project on Multimedia use case. • 4 items • Updated Nov 25, 2024

updated a collection 8 months ago

LLaVA-MORE

LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 • 2 items • Updated Aug 31, 2024