2 20 4

oh sehun

sehun

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Use Models from the Hugging Face Hub in LM Studio

upvoted an article 3 days ago

Introducing smolagents: simple agents that write actions in code.

upvoted an article 3 days ago

Welcome to Inference Providers on the Hub 🔥

View all activity

Organizations

None yet

sehun's activity

upvoted 3 articles 3 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 136

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 550

Article

Welcome to Inference Providers on the Hub 🔥

8 days ago

• 232

upvoted an article 12 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

13 days ago

• 109

upvoted a paper 15 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 19 days ago • 104

upvoted an article 16 days ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

20 days ago

• 63

upvoted an article 18 days ago

Article

The Large Language Model Course

•

19 days ago

• 86

upvoted a paper 19 days ago

Multi-task retriever fine-tuning for domain-specific and efficient RAG

Paper • 2501.04652 • Published 27 days ago • 10

upvoted 2 papers 21 days ago

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published 25 days ago • 43

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published 25 days ago • 60

upvoted 2 papers about 1 month ago

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 64

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 17

upvoted 3 papers about 2 months ago

FastVLM: Efficient Vision Encoding for Vision Language Models

Paper • 2412.13303 • Published Dec 17, 2024 • 13

VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

Paper • 2412.10704 • Published Dec 14, 2024 • 15

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 44

upvoted an article 2 months ago

Article

EuroLLM-9B

•

Dec 2, 2024

• 108

upvoted a paper 4 months ago

ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression

Paper • 2410.08584 • Published Oct 11, 2024 • 12

upvoted an article 6 months ago

Article

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

Feb 10, 2023

• 49