9 6 28

Jon Wang

Cornmonster

UranusSeven

AI & ML interests

None yet

Recent Activity

liked a model 21 days ago

Qwen/Qwen2.5-Omni-7B

liked a dataset 2 months ago

open-r1/OpenThoughts-114k-math

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

Cornmonster's activity

liked a model 21 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 3 days ago • 145k • 1.41k

liked a dataset 2 months ago

open-r1/OpenThoughts-114k-math

Viewer • Updated Jan 30 • 89.1k • 1.15k • 79

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 381

liked 2 models 3 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 22 days ago • 1.68M • • 11.9k

facebook/multi-token-prediction

Updated Jun 18, 2024 • 369

upvoted a paper 5 months ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 56

liked a model 6 months ago

genmo/mochi-1-preview

Text-to-Video • Updated Dec 18, 2024 • 24.7k • • 1.21k

upvoted a paper 8 months ago

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 21

liked a dataset about 1 year ago

harshitasaxena/Humour_check

Viewer • Updated Jan 22, 2024 • 200k • 13 • 3

upvoted a paper about 1 year ago

EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

Paper • 2401.15077 • Published Jan 26, 2024 • 21

liked 2 models over 1 year ago

openchat/openchat_3.5

Text Generation • Updated May 18, 2024 • 12.8k • 1.13k

01-ai/Yi-34B

Text Generation • Updated Nov 11, 2024 • 5.21k • 1.3k

upvoted a paper over 1 year ago

FlashDecoding++: Faster Large Language Model Inference on GPUs

Paper • 2311.01282 • Published Nov 2, 2023 • 37

liked a model over 1 year ago

NousResearch/Yarn-Mistral-7b-128k

Text Generation • Updated Nov 2, 2023 • 10.1k • 572

liked 2 datasets over 1 year ago

stingning/ultrachat

Viewer • Updated Feb 22, 2024 • 774k • 2.43k • 436

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 1.84k • 356

New activity in Qwen/Qwen-14B-Chat over 1 year ago

int4量化Qwen/Qwen-14B-Chat运行出错

#2 opened over 1 year ago by

Trenx

liked a model over 1 year ago

cerebras/btlm-3b-8k-base

Text Generation • Updated Oct 23, 2023 • 2.19k • 262

upvoted a paper over 1 year ago

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 25

New activity in Xorbits/chatglm2-6B-GGML over 1 year ago

How to deploy it as a service and use APIs to call it.

#5 opened over 1 year ago by

gnine