3 23 438

trevor PRO

TrevorJS

TrevorS

AI & ML interests

small models

Recent Activity

liked a dataset 3 days ago

nvidia/ClimbLab

liked a model 3 days ago

microsoft/MAI-DS-R1

liked a model 3 days ago

microsoft/bitnet-b1.58-2B-4T

View all activity

Organizations

None yet

TrevorJS's activity

upvoted an article 9 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 172

upvoted a paper 12 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 14 days ago • 164

upvoted a paper 15 days ago

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Paper • 2503.04812 • Published Mar 4 • 14

upvoted a paper 21 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 27 days ago • 139

upvoted a paper about 1 month ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 116

upvoted a paper about 2 months ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published Mar 4 • 24

upvoted a collection about 2 months ago

Cohere Labs Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 6 days ago • 68

upvoted a paper about 2 months ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26 • 39

upvoted an article about 2 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 236

upvoted a collection about 2 months ago

SigLIP2

Collection

36 items • Updated 18 days ago • 67

upvoted a paper 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 226

upvoted a paper 3 months ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 56

upvoted an article 3 months ago

Article

We now support VLMs in smolagents!

Jan 24

• 100

upvoted a collection 3 months ago

Sana

Collection

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 5 days ago • 90

upvoted 4 papers 6 months ago

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Paper • 2410.10594 • Published Oct 14, 2024 • 27

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Paper • 2409.20566 • Published Sep 30, 2024 • 57

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28, 2024 • 19

UniMuMo: Unified Text, Music and Motion Generation

Paper • 2410.04534 • Published Oct 6, 2024 • 19

upvoted a paper 7 months ago

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19, 2024 • 38