1 16 36

Gautier Evennou

Gevennou

AI & ML interests

PhD in ML on Multimodal

Recent Activity

liked a model 17 days ago

google/siglip2-so400m-patch16-512-jax

liked a Space 18 days ago

nanotron/ultrascale-playbook

liked a model 27 days ago

Zyphra/Zonos-v0.1-hybrid

View all activity

Organizations

Gevennou's activity

liked a model 17 days ago

google/siglip2-so400m-patch16-512-jax

Zero-Shot Image Classification • Updated 17 days ago • 3

liked a Space 18 days ago

2.15k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 27 days ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated 23 days ago • 58.1k • 1.04k

liked a Space about 1 month ago

Vilt Nlvr

🚀

Compare two images with a sentence

liked a model about 1 month ago

microsoft/Florence-2-large

Image-Text-to-Text • Updated Dec 8, 2024 • 3.68M • 1.45k

New activity in facebook/emu_edit_test_set_generations about 2 months ago

[ISSUE] What's up with "a train station in city" captions ?

#3 opened over 1 year ago by

Gevennou

liked a model 2 months ago

microsoft/phi-4

Text Generation • Updated 14 days ago • 563k • • 1.88k

upvoted a paper 3 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

liked a model 5 months ago

stabilityai/stable-diffusion-3.5-large

Text-to-Image • Updated Oct 22, 2024 • 158k • • 2.43k

upvoted 2 papers 5 months ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 94

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 108

liked 2 Spaces 6 months ago

Gradio Lipsync Wav2lip

👄

Generate lip-synced video from video/image and audio

806

Face to All

👨

AI filter for your portraits

liked 2 Spaces 7 months ago

811

Parler-TTS

🥖

High-fidelity Text-To-Speech

469

Florence2 + SAM2

🔥

Segment objects in images and videos using text prompts

liked a model 8 months ago

BleachNick/SD3_UltraEdit_w_mask

Text-to-Image • Updated Jun 30, 2024 • 1.54k • 12

upvoted a paper 9 months ago

StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

Paper • 2406.13735 • Published Jun 19, 2024 • 5

liked a model 9 months ago

AIRI-Institute/StyleFeatureEditor

Image-to-Image • Updated Jul 19, 2024 • 10

upvoted a paper 9 months ago

The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

Paper • 2406.10601 • Published Jun 15, 2024 • 66

liked a Space 9 months ago

743

Florence 2

📉

Analyze images to generate captions, detect objects, or perform OCR