Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy Paper • 2310.01334 • Published Oct 2, 2023 • 3
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark Paper • 2402.11592 • Published Feb 18 • 2