Knowledge Engineer Group @ Tsinghua University

university

https://keg.cs.tsinghua.edu.cn/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

amyxx2001 authored a paper about 2 hours ago

EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

amyxx2001 submitted a paper about 6 hours ago

EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

mozhu submitted a paper 9 days ago

Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces

View all activity

Papers

EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

View all Papers

THU-KEG 's models 84

THU-KEG/LongTraceRL-30B

Reinforcement Learning • 31B • Updated 11 days ago • 48 • 1

THU-KEG/LongTraceRL-8B

Reinforcement Learning • Updated 11 days ago • 1

THU-KEG/LongTraceRL-4B

Reinforcement Learning • 4B • Updated 11 days ago • 55 • 1

THU-KEG/DeepDive-30B-A3B-C-GRPO

31B • Updated Mar 25 • 5

THU-KEG/DeepDive-4B-C-GRPO

4B • Updated Mar 25 • 6

THU-KEG/DeepDive-30B-A3B-SFT

31B • Updated Mar 25 • 3

THU-KEG/DeepDive-4B-SFT

4B • Updated Mar 25 • 10

THU-KEG/WildReward-8B

Text Classification • 8B • Updated Feb 26 • 11 • 3

THU-KEG/WildReward-4B

Text Classification • 4B • Updated Feb 26 • 17 • 4

THU-KEG/LLaDA-8B-BGPO-sudoku

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 4 • 1

THU-KEG/LLaDA-8B-BGPO-countdown

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 2 • 1

THU-KEG/LLaDA-8B-BGPO-code

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 5 • 1

THU-KEG/LLaDA-8B-BGPO-math

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 4 • 1

THU-KEG/DeepPrune-Judge-4B

Text Classification • Updated Oct 11, 2025 • 13 • 2

THU-KEG/SIRI-1.5B-low

Text Generation • 2B • Updated Sep 30, 2025 • 3 • 2

THU-KEG/SIRI-1.5B-high

Text Generation • 2B • Updated Sep 30, 2025 • 3 • 3

THU-KEG/SIRI-7B-low

Text Generation • 8B • Updated Sep 30, 2025 • 6 • 2

THU-KEG/SIRI-7B-high

Text Generation • 8B • Updated Sep 30, 2025 • 14 • • 5

THU-KEG/LongWriter-Zero-32B

Text Generation • 33B • Updated Jul 3, 2025 • 115 • • 113

THU-KEG/IF-Verifier-7B

Text Generation • 8B • Updated Jun 12, 2025 • 22 • • 2

THU-KEG/R1-Distill-Qwen-7B-VerIF

Text Generation • 8B • Updated Jun 12, 2025 • 5

THU-KEG/TULU3-VerIF

Text Generation • 8B • Updated Jun 12, 2025 • 11 • 3

THU-KEG/AdaptThink-7B-delta0.05

8B • Updated May 20, 2025 • 3 • 1

THU-KEG/AdaptThink-1.5B-delta0.1

2B • Updated May 20, 2025 • 5 • 2

THU-KEG/AdaptThink-1.5B-delta0.075

2B • Updated May 20, 2025 • 1

THU-KEG/AdaptThink-1.5B-delta0.02

2B • Updated May 20, 2025 • 1

THU-KEG/AdaptThink-1.5B-delta0.01

2B • Updated May 20, 2025 • 7 • 1

THU-KEG/AdaptThink-1.5B-delta0

2B • Updated May 20, 2025 • 5

THU-KEG/AdaptThink-1.5B-delta0.05

2B • Updated May 20, 2025 • 11

THU-KEG/ReaRAG-9B

Question Answering • Updated Apr 18, 2025 • 17 • 2