Upload folder using huggingface_hub

Files changed (9) hide show

README.md CHANGED Viewed

@@ -11,9 +11,9 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# gridone-ko-llm-12.8b-v1.1d
-This model is a fine-tuned version of [EleutherAI/polyglot-ko-12.8b](https://huggingface.co/EleutherAI/polyglot-ko-12.8b) on an unknown dataset.
 ## Model description
@@ -41,6 +41,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 8
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# gridone-ko-llm-12.8b-v1.1d-supervised
+This model is a fine-tuned version of [EleutherAI/polyglot-ko-12.8b](https://huggingface.co/EleutherAI/polyglot-ko-12.8b) on modified KoAlpaca v1.1 dataset (1.1d), using supervised-style finetuning.
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 8
+- early_stop_epoch: 4
 ### Framework versions

optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6e0fb9bcae2d6d87c41d93ed7644fb4792c73655c348a1528d0edc63a6597bc3
 size 24400263

 version https://git-lfs.github.com/spec/v1
+oid sha256:ba0d8dd19d0b33d24bfdc907922aa7db91f578c224ba34d45f6997d97e4fa906
 size 24400263

pytorch_model-00001-of-00003.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cef5ac08250f925f719e5145cfe42b00f7bd45adb70639259cf59699e122bf71
 size 9957073034

 version https://git-lfs.github.com/spec/v1
+oid sha256:fe048d227a62d668d7862081178500f593e8235f16e564d4e256c495d749df3c
 size 9957073034

pytorch_model-00002-of-00003.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:176324a1173af6a4589ad0e14d23dc782d865d797fae87d226bcfdd818f9ca74
 size 9858779099

 version https://git-lfs.github.com/spec/v1
+oid sha256:e276b17efcb9515ce7bfb8689dd1ecb3383c6f1568debf0c60842d8a542526de
 size 9858779099

pytorch_model-00003-of-00003.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:61cdbf74d8266e06d4e9dde0e4614146cef11e8ee2f72df7c69199970cc23957
 size 5971549140

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ab65ec8a254633de94efc415d955256038996d957b1756f39d02980081324b0
 size 5971549140

rng_state.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7508d4b8dd267de5cc58e972da25236687927651336a28f292c92f7f23951475
 size 14575

 version https://git-lfs.github.com/spec/v1
+oid sha256:c6869750f95a25c4e970298a33adf90e2d7ab52680bf3317239bff1b10103235
 size 14575

scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6b02be2c961050591e39e048540caca52c7186951e99ec479ac50c29b65770a6
 size 627

 version https://git-lfs.github.com/spec/v1
+oid sha256:517ee32571fae656ca32d3755631be92d2c857caec199017bdff8f4c80c3053d
 size 627

trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e958a56f84f43fcb3a0dd6ceff97b22bbd87dcdac1c403dc3affa48704646e3e
 size 4091

 version https://git-lfs.github.com/spec/v1
+oid sha256:2d0e23387816c7b5407851a7f2c31269b79df23c3442b77ab25fa52cca82ff6d
 size 4091