Eyel (Eyel)

upvoted a collection 5 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 326

upvoted 2 papers 6 months ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20, 2025 • 85

Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters

Paper • 2507.13618 • Published Jul 18, 2025 • 16

upvoted a paper 7 months ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published Jul 2, 2025 • 36

upvoted 2 papers 10 months ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 82

Scaling Laws of Decoder-Only Models on the Multilingual Machine Translation Task

Paper • 2409.15051 • Published Sep 23, 2024 • 2

upvoted a paper 11 months ago

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Paper • 2503.00865 • Published Mar 2, 2025 • 64

upvoted 3 papers about 1 year ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 124

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28, 2025 • 36

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 28

upvoted an article about 1 year ago

Article

EuroLLM-9B

Dec 2, 2024

•

139

upvoted a paper about 1 year ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 133

upvoted an article about 1 year ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Dec 4, 2024

•

80

upvoted 7 papers over 1 year ago

X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale

Paper • 2410.03115 • Published Oct 4, 2024 • 1

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 100

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 180

Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis

Paper • 2409.20059 • Published Sep 30, 2024 • 16

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 50

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 120

Eyel

AI & ML interests

Organizations

Apertus LLM

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

BitNet b1.58 2B4T Technical Report

Scaling Laws of Decoder-Only Models on the Multilingual Machine Translation Task

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

s1: Simple test-time scaling

Optimizing Large Language Model Training Using FP4 Quantization

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

EuroLLM-9B

PaliGemma 2: A Family of Versatile VLMs for Transfer

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale

Movie Gen: A Cast of Media Foundation Models

Differential Transformer

Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Training Language Models to Self-Correct via Reinforcement Learning

SAM 2: Segment Anything in Images and Videos

Eyel

AI & ML interests

Organizations

Eyel's activity

EuroLLM-9B

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs