matlok 's Collections
LMM

Papers - Text - Training - Batch Scaling - Cut Cross Entropy