Gemma-2-2b a lightweight version has Sota performance at the same size finetuned on Vietnamese dataset. Model focused mainly on vietnamese
Achieved 42.35 on VMLU test