Maziyar Panahi's picture

Maziyar Panahi PRO

MaziyarPanahi

·

AI & ML interests

Fine-Tuning, RLHF, Merging, Quantizations, Leaderboards

Recent Activity

liked a model about 20 hours ago

microsoft/Phi-4-multimodal-instruct

liked a dataset 1 day ago

lmarena-ai/arena-human-preference-100k

updated a collection 2 days ago

View all activity

Organizations

MaziyarPanahi's activity

upvoted a collection 3 days ago

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 1 day ago • 44

upvoted a collection 6 days ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 8 days ago • 49

upvoted an article 14 days ago

Article

Fixing Open LLM Leaderboard with Math-Verify

15 days ago

• 26

upvoted a paper 14 days ago

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published 15 days ago • 30

upvoted an article 15 days ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

17 days ago

• 49

upvoted a collection 21 days ago

Hibiki fr-en

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 22 days ago • 50

upvoted a collection 24 days ago

Reasoning

4 items • Updated 2 days ago • 1

upvoted a collection 26 days ago

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated 26 days ago • 55

upvoted an article 26 days ago

Article

Open-R1: Update #1

By

and 7 others •

27 days ago

• 288

upvoted a collection about 1 month ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 8 items • Updated 4 days ago • 379

upvoted an article about 1 month ago

Article

We now support VLMs in smolagents!

Jan 24

• 86

upvoted a paper about 1 month ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 81

upvoted an article about 1 month ago

Article

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

Dec 16, 2024

• 109

upvoted a paper about 1 month ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 95

upvoted an article about 1 month ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 150

upvoted a collection about 1 month ago

InternLM3

6 items • Updated 17 days ago • 23

upvoted an article about 2 months ago

Article

Mastering Tensor Dimensions in Transformers

By

•

Jan 12

• 44

upvoted 2 collections about 2 months ago

Phi-4

Phi-4 family of small language and multi-modal models. • 7 items • Updated about 2 hours ago • 81

GIANTS

Frankenstein and giant models merged! • 11 items • Updated 24 days ago • 4

upvoted a collection 2 months ago

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 30 days ago • 26