2 82 241

kelechic

tensorkelechi

https://kelechi-c.github.io/

AI & ML interests

vision

Recent Activity

liked a model 10 days ago

nateraw/musicgen-songstarter-v0.2

liked a dataset 12 days ago

Prarabdha/spotify_music

upvoted a paper 14 days ago

SmolVLM: Redefining small and efficient multimodal models

View all activity

Organizations

tensorkelechi's activity

upvoted a paper 14 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 16 days ago • 168

upvoted 2 papers about 1 month ago

Neural Vocoder is All You Need for Speech Super-resolution

Paper • 2203.14941 • Published Mar 28, 2022 • 1

MusicInfuser: Making Video Diffusion Listen and Dance

Paper • 2503.14505 • Published Mar 18 • 11

upvoted an article about 1 month ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 398

upvoted a paper about 1 month ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published Mar 6 • 23

upvoted an article about 2 months ago

Article

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Jan 26, 2023

• 57

upvoted a collection 2 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 598

upvoted a paper 2 months ago

SoundStorm: Efficient Parallel Audio Generation

Paper • 2305.09636 • Published May 16, 2023 • 7

upvoted a collection 2 months ago

CLAP: Contrastive Language-Audio Pretraining

Collection

CLAP is to audio what CLIP is to image. • 5 items • Updated Oct 31, 2023 • 10

upvoted a paper 2 months ago

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Paper • 2402.01831 • Published Feb 2, 2024 • 15

upvoted 2 articles 2 months ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 241

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 172

upvoted a paper 3 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 226

upvoted an article 3 months ago

Article

State of open video generation models in Diffusers

Jan 27

• 52

upvoted a paper 4 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 114

upvoted a collection 4 months ago

Cosmos Tokenizer

Collection

A suite of image and video tokenizers • 13 items • Updated 9 days ago • 40

upvoted a paper 4 months ago

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Paper • 2402.03766 • Published Feb 6, 2024 • 14