忍者's picture

134 333

忍者

byteprobe

·

AI & ML interests

RL | NLP | LLM | LMM | agent

Recent Activity

liked a dataset about 9 hours ago

ServiceNow-AI/R1-Distill-SFT

liked a model about 9 hours ago

Qwen/Qwen2.5-VL-72B-Instruct

liked a model about 9 hours ago

m-a-p/YuE-s1-7B-anneal-en-cot

View all activity

Organizations

byteprobe's activity

upvoted a collection 4 days ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 1 day ago • 138

upvoted 5 papers 5 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 86

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 15 days ago • 31

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published 13 days ago • 48

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 15 days ago • 89

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 13 days ago • 83

upvoted a collection 5 days ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 126

upvoted 3 papers 6 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 12 days ago • 284

Humanity's Last Exam

Paper • 2501.14249 • Published 11 days ago • 50

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 9 days ago • 49

upvoted 4 collections 6 days ago

DeepSeek-V3

3 items • Updated 29 days ago • 171

DeepSeek-R1

8 items • Updated 14 days ago • 361

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 8 days ago • 96

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 8 days ago • 311

upvoted a collection 10 days ago

llama.vim

Recommended models for the llama.vim plugin • 5 items • Updated 4 days ago • 21

upvoted an article 10 days ago

Article

Introduction to ggml

Aug 13, 2024

• 139

upvoted an article 14 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark

By

•

24 days ago

• 4

upvoted 2 papers 14 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 18 days ago • 36

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 18 days ago • 104

upvoted a paper 15 days ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published 18 days ago • 33