GoLongRL
-
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 58 -
Kwai-Klear/GoLongRL-30B-A3B
Text Generation • 31B • Updated • 471 • 11 -
Kwai-Klear/GoLongRL-4B
Text Generation • 4B • Updated • 178 • 4 -
Kwai-Klear/GoLongRL
Viewer • Updated • 23k • 1.26k • 23