Update README.md
Browse files
README.md
CHANGED
@@ -5,11 +5,11 @@ language:
|
|
5 |
metrics:
|
6 |
- accuracy
|
7 |
---
|
8 |
-
# Model Card for
|
9 |
|
10 |
<!-- Provide a quick summary of what the model is/does. -->
|
11 |
|
12 |
-
|
13 |
|
14 |
## Model Details
|
15 |
|
@@ -19,20 +19,16 @@ This modelcard aims to be a base template for new models. It has been generated
|
|
19 |
|
20 |
|
21 |
|
22 |
-
- **Developed by:**
|
23 |
-
- **
|
24 |
-
- **
|
25 |
-
- **Language(s) (NLP):** [More Information Needed]
|
26 |
-
- **License:** [More Information Needed]
|
27 |
-
- **Finetuned from model [optional]:** [More Information Needed]
|
28 |
|
29 |
-
### Model Sources
|
30 |
|
31 |
<!-- Provide the basic links for the model. -->
|
32 |
|
33 |
-
- **Repository:**
|
34 |
-
- **
|
35 |
-
- **Demo [optional]:** [More Information Needed]
|
36 |
|
37 |
## Uses
|
38 |
|
@@ -53,14 +49,17 @@ This modelcard aims to be a base template for new models. It has been generated
|
|
53 |
### Out-of-Scope Use
|
54 |
|
55 |
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
|
56 |
-
|
57 |
-
|
|
|
|
|
58 |
|
59 |
## Bias, Risks, and Limitations
|
60 |
|
61 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
62 |
-
|
63 |
-
|
|
|
64 |
|
65 |
### Recommendations
|
66 |
|
@@ -88,12 +87,21 @@ Use the code below to get started with the model.
|
|
88 |
|
89 |
#### Preprocessing [optional]
|
90 |
|
91 |
-
|
92 |
-
|
|
|
|
|
|
|
93 |
|
94 |
#### Training Hyperparameters
|
95 |
|
96 |
-
- **
|
|
|
|
|
|
|
|
|
|
|
|
|
97 |
|
98 |
#### Speeds, Sizes, Times [optional]
|
99 |
|
@@ -111,7 +119,8 @@ Use the code below to get started with the model.
|
|
111 |
|
112 |
<!-- This should link to a Data Card if possible. -->
|
113 |
|
114 |
-
|
|
|
115 |
|
116 |
#### Factors
|
117 |
|
@@ -133,25 +142,8 @@ Use the code below to get started with the model.
|
|
133 |
|
134 |
|
135 |
|
136 |
-
## Model Examination [optional]
|
137 |
-
|
138 |
-
<!-- Relevant interpretability work for the model goes here -->
|
139 |
-
|
140 |
-
[More Information Needed]
|
141 |
|
142 |
-
##
|
143 |
-
|
144 |
-
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
145 |
-
|
146 |
-
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
147 |
-
|
148 |
-
- **Hardware Type:** [More Information Needed]
|
149 |
-
- **Hours used:** [More Information Needed]
|
150 |
-
- **Cloud Provider:** [More Information Needed]
|
151 |
-
- **Compute Region:** [More Information Needed]
|
152 |
-
- **Carbon Emitted:** [More Information Needed]
|
153 |
-
|
154 |
-
## Technical Specifications [optional]
|
155 |
|
156 |
### Model Architecture and Objective
|
157 |
|
@@ -159,19 +151,20 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
|
|
159 |
|
160 |
### Compute Infrastructure
|
161 |
|
162 |
-
|
163 |
|
164 |
#### Hardware
|
165 |
|
166 |
-
|
167 |
|
168 |
#### Software
|
169 |
|
170 |
-
|
171 |
|
172 |
-
## Citation
|
173 |
|
174 |
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
|
|
175 |
|
176 |
**BibTeX:**
|
177 |
|
@@ -191,12 +184,12 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
|
|
191 |
|
192 |
[More Information Needed]
|
193 |
|
194 |
-
## Model Card Authors
|
195 |
|
196 |
-
|
197 |
|
198 |
## Model Card Contact
|
199 |
|
200 |
-
|
201 |
|
202 |
|
|
|
5 |
metrics:
|
6 |
- accuracy
|
7 |
---
|
8 |
+
# Model Card for KorSciDeBERTa
|
9 |
|
10 |
<!-- Provide a quick summary of what the model is/does. -->
|
11 |
|
12 |
+
KorSciDeBERTa๋ Microsoft DeBERTa ๋ชจ๋ธ์ ์ํคํ
์ณ๋ฅผ ๊ธฐ๋ฐ์ผ๋ก, ๋
ผ๋ฌธ, NTIS ์ฐ๊ตฌ๊ณผ์ , ํนํ, ๋ด์ค, ํ๊ตญ์ด ์ํค ์ฝํผ์ค ์ด 146GB๋ฅผ ์ฌ์ ํ์ตํ ๋ชจ๋ธ์
๋๋ค. ๋ง์คํน๋ ์ธ์ด ๋ชจ๋ธ๋ง ๋๋ ๋ค์ ๋ฌธ์ฅ ์์ธก์ ์ฌ์ ํ์ต ๋ชจ๋ธ์ ์ฌ์ฉํ ์ ์๊ณ , ๋ํ ๋ฌธ์ฅ ๋ถ๋ฅ, ๋จ์ด ํ ํฐ ๋ถ๋ฅ ๋๋ ์ง์์๋ต๊ณผ ๊ฐ์ ๋ค์ด์คํธ๋ฆผ ์์
์์ ๋ฏธ์ธ ์กฐ์ ์ ํตํด ์ฌ์ฉ๋ ์ ์์ต๋๋ค.
|
13 |
|
14 |
## Model Details
|
15 |
|
|
|
19 |
|
20 |
|
21 |
|
22 |
+
- **Developed by:** KISTI
|
23 |
+
- **Model type:** deberta-v2
|
24 |
+
- **Language(s) (NLP):** ํ๊ธ(ko)
|
|
|
|
|
|
|
25 |
|
26 |
+
### Model Sources
|
27 |
|
28 |
<!-- Provide the basic links for the model. -->
|
29 |
|
30 |
+
- **Repository 1:** https://huggingface.co/kisti/korscideberta
|
31 |
+
- **Repository 2:** https://aida.kisti.re.kr/
|
|
|
32 |
|
33 |
## Uses
|
34 |
|
|
|
49 |
### Out-of-Scope Use
|
50 |
|
51 |
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
|
52 |
+
์ด ๋ชจ๋ธ์ ์๋์ ์ผ๋ก ์ฌ๋๋ค์๊ฒ ์ ๋์ ์ด๋ ์์ธ๋ ํ๊ฒฝ์ ์กฐ์ฑํ๋๋ฐ ์ฌ์ฉ๋์ด์๋ ์ ๋ฉ๋๋ค.
|
53 |
+
์ด ๋ชจ๋ธ์ '๊ณ ์ํ ์ค์ '์์ ์ฌ์ฉ๋ ์ ์์ต๋๋ค. ์ด ๋ชจ๋ธ์ ์ฌ๋์ด๋ ์ฌ๋ฌผ์ ๋ํ ์ค์ํ ๊ฒฐ์ ์ ๋ด๋ฆด ์ ์๊ฒ ์ค๊ณ๋์ง ์์์ต๋๋ค. ๋ชจ๋ธ์ ์ถ๋ ฅ๋ฌผ์ ์ฌ์ค์ด ์๋ ์ ์์ต๋๋ค.
|
54 |
+
'๊ณ ์ํ ์ค์ '์ ๋ค์๊ณผ ๊ฐ์ ์ฌํญ์ ํฌํจํฉ๋๋ค:
|
55 |
+
์๋ฃ/์ ์น/๋ฒ๋ฅ /๊ธ์ต ๋ถ์ผ์์์ ์ฌ์ฉ, ๊ณ ์ฉ/๊ต์ก/์ ์ฉ ๋ถ์ผ์์์ ์ธ๋ฌผ ํ๊ฐ, ์๋์ผ๋ก ์ค์ํ ๊ฒ์ ๊ฒฐ์ ํ๊ธฐ, (๊ฐ์ง)์ฌ์ค์ ์์ฑํ๊ธฐ, ์ ๋ขฐ๋ ๋์ ์์ฝ๋ฌธ ์์ฑ, ํญ์ ์ณ์์ผ๋ง ํ๋ ์์ธก ์์ฑ ๋ฑ.
|
56 |
|
57 |
## Bias, Risks, and Limitations
|
58 |
|
59 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
60 |
+
์ฐ๊ตฌ๋ชฉ์ ์ผ๋ก ์ ์๊ถ ๋ฌธ์ ๊ฐ ์๋ ๋ง๋ญ์น ๋ฐ์ดํฐ๋ง์ ์ฌ์ฉํ์์ต๋๋ค. ์ด ๋ชจ๋ธ์ ์ฌ์ฉ์๋ ์๋์ ์ํ ์์ธ๋ค์ ์ธ์ํด์ผ ํฉ๋๋ค.
|
61 |
+
์ฌ์ฉ๋ ๋ง๋ญ์น๋ ๋๋ถ๋ถ ์ค๋ฆฝ์ ์ธ ์ฑ๊ฒฉ์ ๊ฐ์ง๊ณ ์๋๋ฐ๋ ๋ถ๊ตฌํ๊ณ , ์ธ์ด ๋ชจ๋ธ์ ํน์ฑ์ ์๋์ ๊ฐ์ ์ค๋ฆฌ ๊ด๋ จ ์์๋ฅผ ์ผ๋ถ ํฌํจํ ์ ์์ต๋๋ค:
|
62 |
+
ํน์ ๊ด์ ์ ๋ํ ๊ณผ๋/๊ณผ์ ํํ, ๊ณ ์ ๊ด๋
, ๊ฐ์ธ ์ ๋ณด, ์ฆ์ค/๋ชจ์ ๋๋ ํญ๋ ฅ์ ์ธ ์ธ์ด, ์ฐจ๋ณ์ ์ด๊ฑฐ๋ ํธ๊ฒฌ์ ์ธ ์ธ์ด, ๊ด๋ จ์ด ์๊ฑฐ๋ ๋ฐ๋ณต์ ์ธ ์ถ๋ ฅ ์์ฑ ๋ฑ.
|
63 |
|
64 |
### Recommendations
|
65 |
|
|
|
87 |
|
88 |
#### Preprocessing [optional]
|
89 |
|
90 |
+
- ๊ณผํ๊ธฐ์ ๋ถ์ผ ํ ํฌ๋์ด์ (KorSci Tokenizer)
|
91 |
+
- ๋ณธ ์ฌ์ ํ์ต ๋ชจ๋ธ์์ ์ฌ์ฉ๋ ์ฝํผ์ค๋ฅผ ๊ธฐ๋ฐ์ผ๋ก ๋ช
์ฌ ๋ฐ ๋ณตํฉ๋ช
์ฌ ์ฝ 600๋ง๊ฐ์ ์ฌ์ฉ์์ฌ์ ์ด ์ถ๊ฐ๋ [Mecab-ko Tokenizer](https://bitbucket.org/eunjeon/mecab-ko/src/master/)์ ๊ธฐ์กด SentencePiece-BPE๊ฐ ๋ณํฉ๋์ด์ง ํ ํฌ๋์ด์ ๋ฅผ ์ฌ์ฉํ์ฌ ๋ง๋ญ์น๋ฅผ ์ ์ฒ๋ฆฌํ์์ต๋๋ค.
|
92 |
+
- Total 128,100 words
|
93 |
+
- Included special tokens ( <unk>, <cls>, <s>, <mask> )
|
94 |
+
- File name : spm.model, vocab.txt
|
95 |
|
96 |
#### Training Hyperparameters
|
97 |
|
98 |
+
- **model_size:** base
|
99 |
+
- **num_train_steps:** 1,600,000
|
100 |
+
- **train_batch_size:** 4,096 * 4 accumulative update = 16,384
|
101 |
+
- **learning_rate:** 1e-4
|
102 |
+
- **max_seq_length:** 512
|
103 |
+
- **vocab_size:** 128,100
|
104 |
+
- **Training regime:** fp16 mixed precision <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
105 |
|
106 |
#### Speeds, Sizes, Times [optional]
|
107 |
|
|
|
119 |
|
120 |
<!-- This should link to a Data Card if possible. -->
|
121 |
|
122 |
+
๋ณธ ์ธ์ด๋ชจ๋ธ์ ์ฑ๋ฅํ๊ฐ๋ ์ฐ๊ตฌ๊ณผ์ ๋ณด๊ณ ์ ๊ณผํ๊ธฐ์ ํ์ค๋ถ๋ฅ ํ์คํฌ์ ํ์ธํ๋ํ์ฌ ํ๊ฐํ๋ ๋ฐฉ์์ ์ฌ์ฉํ์์ผ๋ฉฐ, ๊ทธ ๊ฒฐ๊ณผ๋ ์๋์ ๊ฐ์ต๋๋ค.
|
123 |
+
- ์ฐ๊ตฌ๊ณผ์ ๋ณด๊ณ ์ ๊ณผํ๊ธฐ์ ํ์ค๋ถ๋ฅ ํ๊ฐ ๋ฐ์ดํฐ์
(doi.org/10.23057/50), 145 Classes, 209,454 Training Set, 89,767 Test Set
|
124 |
|
125 |
#### Factors
|
126 |
|
|
|
142 |
|
143 |
|
144 |
|
|
|
|
|
|
|
|
|
|
|
145 |
|
146 |
+
## Technical Specifications
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
147 |
|
148 |
### Model Architecture and Objective
|
149 |
|
|
|
151 |
|
152 |
### Compute Infrastructure
|
153 |
|
154 |
+
KISTI ๊ตญ๊ฐ์ํผ์ปดํจํ
์ผํฐ NEURON ์์คํ
. HPE ClusterStor E1000, Lustre, Slurm
|
155 |
|
156 |
#### Hardware
|
157 |
|
158 |
+
NVIDIA A100 80G GPU 24EA
|
159 |
|
160 |
#### Software
|
161 |
|
162 |
+
Python 3.9, Cuda 11.8, PyTorch 1.10
|
163 |
|
164 |
+
## Citation
|
165 |
|
166 |
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
167 |
+
ํ๊ตญ๊ณผํ๊ธฐ์ ์ ๋ณด์ฐ๊ตฌ์ (2023) : ํ๊ตญ์ด ๊ณผํ๊ธฐ์ ๋ถ์ผ DeBERTa ์ฌ์ ํ์ต ๋ชจ๋ธ (KorSciDeBERTa). Version 1.0. ํ๊ตญ๊ณผํ๊ธฐ์ ์ ๋ณด์ฐ๊ตฌ์.
|
168 |
|
169 |
**BibTeX:**
|
170 |
|
|
|
184 |
|
185 |
[More Information Needed]
|
186 |
|
187 |
+
## Model Card Authors
|
188 |
|
189 |
+
๊น๊ฒฝ๋ฏผ, ๊น์ํฌ, ๊น์ฑ์ฐฌ. ํ๊ตญ๊ณผํ๊ธฐ์ ์ ๋ณด์ฐ๊ตฌ์ ์ธ๊ณต์ง๋ฅ๋ฐ์ดํฐ์ฐ๊ตฌ๋จ
|
190 |
|
191 |
## Model Card Contact
|
192 |
|
193 |
+
๊น๊ฒฝ๋ฏผ, [email protected]
|
194 |
|
195 |
|