Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yusheng Su's picture
1 1 1

Yusheng Su

yushengsu
dark-pen's profile picture
·
https://yushengsu-thu.github.io/
  • yushengsu-thu

AI & ML interests

None yet

Recent Activity

updated a dataset 4 days ago
yushengsu/profiling-traces
published a dataset 4 days ago
yushengsu/profiling-traces
updated a dataset 25 days ago
yushengsu/lora-diff-Qwen3.5-35B-A3B
View all activity

Organizations

Thinking Machines Lab's profile picture

Papers 3

arxiv:2501.04227
arxiv:2308.10848
arxiv:2306.02320

models 6

yushengsu/sgl-lora-data

Updated Mar 17

yushengsu/sglang_lora_logprob_diff_without_tuning

Updated Dec 9, 2025 • 1.59k

yushengsu/Qwen3-4B-torch-dist

Updated Oct 10, 2025

yushengsu/coke-bert-base-uncased-2hop

Updated Jul 1, 2023 • 1

yushengsu/coke-roberta-base-2hop

Updated Jul 1, 2023 • 1

yushengsu/CokeBERT

Updated Dec 20, 2022

datasets 11

yushengsu/profiling-traces

Updated 4 days ago • 46

yushengsu/lora-diff-Qwen3.5-35B-A3B

Updated 25 days ago • 31

yushengsu/lora-diff-NVIDIA-Nemotron-3-Super-120B-A12B-BF16

Updated Apr 2 • 29

yushengsu/lora-diff-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Updated Apr 2 • 26

yushengsu/rope-dump-DeepSeek-V3.1-Base

Updated Mar 20 • 12

yushengsu/lora-diff-gpt-oss-20b

Updated Mar 17 • 47

yushengsu/lora-diff-Qwen3-VL-30B-A3B-Instruct

Updated Mar 17 • 53

yushengsu/lora-diff-Qwen3-8B

Updated Mar 17 • 55

yushengsu/lora-diff-Qwen3-30B-A3B-Instruct-2507

Updated Mar 17 • 64

yushengsu/lora-diff-Kimi-K2.5

Updated Mar 17 • 42
View 11 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs