4 15 25

Longhui Yu

Longhui98

https://yulonghui.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Kimi-VL Technical Report

liked a model 11 days ago

moonshotai/Kimi-VL-A3B-Thinking

authored a paper 3 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

View all activity

Organizations

Longhui98's activity

upvoted a paper 10 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 11 days ago • 114

liked a model 11 days ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • Updated about 15 hours ago • 28.4k • 369

authored 4 papers 3 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 113

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Paper • 2403.09472 • Published Mar 14, 2024 • 1

Forward-Backward Reasoning in Large Language Models for Mathematical Verification

Paper • 2308.07758 • Published Aug 15, 2023 • 4

DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality

Paper • 2303.14585 • Published Mar 25, 2023

upvoted a paper 3 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 113

liked 2 models 3 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 25 days ago • 1.73M • • 12k

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 25 days ago • 5.7k • 901

upvoted 4 papers 4 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 49

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 365

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published Dec 16, 2024 • 11

upvoted a collection 9 months ago

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 52

liked 2 datasets 9 months ago

AI-MO/NuminaMath-TIR

Viewer • Updated Nov 25, 2024 • 72.5k • 7k • 126

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 3.77k • 443

updated a Space 9 months ago

README

🦀

upvoted a collection 9 months ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 77

upvoted a paper 9 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 57

liked a model 9 months ago

mistralai/Mathstral-7B-v0.1

Text Generation • Updated Jul 31, 2024 • 16.3k • 222