Revisiting Hierarchical Text Classification: Inference and Metrics Paper • 2410.01305 • Published Oct 2, 2024
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning Paper • 2502.06533 • Published about 1 month ago • 18