11 52 170

PZ PRO

philipp-zettl

philipp-zettl

AI & ML interests

NLP/CV/Multimodal learning

Recent Activity

updated a model about 2 hours ago

philipp-zettl/bge-small-en-v1.5-mtg

updated a model about 2 hours ago

philipp-zettl/bge-small-en-v1.5-mtg

updated a model about 2 hours ago

philipp-zettl/bge-small-en-v1.5-mtg

View all activity

Organizations

philipp-zettl's activity

upvoted an article 1 day ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

2 days ago

• 88

upvoted 2 papers 3 days ago

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

Paper • 2504.02160 • Published 4 days ago • 1

MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Paper • 2502.18924 • Published Feb 26 • 9

upvoted a paper 5 days ago

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published 6 days ago • 33

upvoted a paper 7 days ago

Your ViT is Secretly an Image Segmentation Model

Paper • 2503.19108 • Published 13 days ago • 17

upvoted a paper 10 days ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published 12 days ago • 47

upvoted an article 14 days ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

27 days ago

• 73

upvoted a paper 18 days ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 19 days ago • 135

upvoted a paper 20 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published 23 days ago • 85

upvoted a paper 21 days ago

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

Paper • 2503.09641 • Published 26 days ago • 31

upvoted an article 27 days ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 458

upvoted a paper about 1 month ago

Auditing Prompt Caching in Language Model APIs

Paper • 2502.07776 • Published Feb 11 • 5

upvoted an article about 1 month ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 224

upvoted a collection about 1 month ago

Phi-4

Collection

Phi-4 family of small language and multi-modal models. • 7 items • Updated Mar 3 • 113

upvoted a paper about 2 months ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 148

upvoted a collection about 2 months ago

Express 🚅

Collection

Express Tiny LLM's • 7 items • Updated 13 days ago • 3

upvoted a paper about 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 220

upvoted 2 papers 3 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 92

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 84