1 18 4

Cyril

cyrilzakka

https://cyrilzakka.github.io

AI & ML interests

Multimodal models for clinical medicine and surgery

Recent Activity

upvoted a paper 8 days ago

Gemma 3 Technical Report

liked a model 22 days ago

arcinstitute/evo2_40b

upvoted an article 29 days ago

You could have designed state of the art positional encoding

View all activity

Organizations

cyrilzakka's activity

upvoted a paper 8 days ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published 10 days ago • 41

upvoted an article 29 days ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 203

upvoted 2 papers about 1 month ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 178

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 139

upvoted an article about 1 month ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 223

upvoted a paper about 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 217

upvoted an article about 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.21k

upvoted a paper about 2 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 59

upvoted a paper 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 273

upvoted 3 papers 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 363

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 62

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 134

upvoted 2 collections 4 months ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 1 day ago • 146

Deepseek Papers

Collection

Deepseek papers collection • 19 items • Updated about 13 hours ago • 181

upvoted a paper 6 months ago

EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation

Paper • 2410.09704 • Published Oct 13, 2024 • 13