SakanaAI/TinySwallow-1.5B
Text Generation
•
Updated
•
1.14k
•
7
Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"