Pretrained GPT2 with expanded n_ctx up to 2048(also with expanded embedding dimension to 1536) in Korean.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 24.27
ARC (25-shot) 21.16
HellaSwag (10-shot) 28.11
MMLU (5-shot) 26.56
TruthfulQA (0-shot) 42.06
Winogrande (5-shot) 49.09
GSM8K (5-shot) 0.0
DROP (3-shot) 2.89
Downloads last month
1,306
Safetensors
Model size
392M params
Tensor type
F32
Β·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Spaces using psyche/kogpt 25