radm (r4dm)

upvoted an article 4 months ago

Article

SOTA OCR with Core ML and dots.ocr

Oct 2, 2025

•

62

upvoted a paper 5 months ago

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1, 2025 • 59

upvoted a paper 8 months ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 92

upvoted a collection 9 months ago

late interaction retrievers

Collection

This collection list our ColBERT like late interaction retriever models • 4 items • Updated Jul 20, 2025 • 2

upvoted 2 articles over 1 year ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Jul 27, 2024

•

34

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

Sep 18, 2024

•

274

upvoted a paper over 1 year ago

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10, 2024 • 68

upvoted a collection over 1 year ago

SimPO

Collection

This collections contains a list of SimPO and baseline models. • 49 items • Updated Mar 16, 2025 • 23

upvoted an article over 1 year ago

Article

Google Search with LLM

May 1, 2024

•

11

upvoted a collection over 1 year ago

abliterated-v3

Collection

Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3, 2024 • 137

upvoted an article over 1 year ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

768

upvoted 2 papers over 1 year ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Paper • 2405.13929 • Published May 22, 2024 • 55

upvoted a paper over 2 years ago

Microscaling Data Formats for Deep Learning

Paper • 2310.10537 • Published Oct 16, 2023 • 8

r4dm

AI & ML interests

Organizations

SOTA OCR with Core ML and dots.ocr

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

late interaction retrievers

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

SimPO

Google Search with LLM

abliterated-v3

Uncensor any LLM with abliteration

Weak-to-Strong Extrapolation Expedites Alignment

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Microscaling Data Formats for Deep Learning

r4dm

AI & ML interests

Organizations

radm's activity

SOTA OCR with Core ML and dots.ocr

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Google Search with LLM

Uncensor any LLM with abliteration