01Matrix commited on
Commit
a633cd1
·
verified ·
1 Parent(s): 2bb268e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +217 -3
README.md CHANGED
@@ -1,3 +1,217 @@
1
- ---
2
- license: llama3.1
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.1
3
+ datasets:
4
+ - BAAI/OPI
5
+ language:
6
+ - en
7
+ pipeline_tag: text-generation
8
+ tags:
9
+ - Life Science
10
+ - AI4Science
11
+ - Biology
12
+ - Protein
13
+ - LLM
14
+ - Instruction
15
+ base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
16
+ ---
17
+ ![OPI_logo](./OPI_logo.png)
18
+
19
+ # Model Card of OPI-Llama-3.1-8B-Instruct
20
+ OPI-Llama-3.1-8B-Instruct was fine-tuned from the Meta-Llama-3.1-8B-Instruct model using the complete OPI dataset (i.e.,[OPI_full_1.61M.json](https://huggingface.co/datasets/BAAI/OPI/blob/main/OPI_DATA/OPI_full_1.61M_train.json)).
21
+ For more details of training and testing, please visit [https://github.com/baaihealth/opi](https://github.com/baaihealth/opi).
22
+
23
+ ![Overview](./OPI_experiment_outline.png)
24
+
25
+ # Evaluation of OPI-Llama-3.1-8B-Instruct on 9 tasks
26
+ Each testing result is derived from the Meta-Llama-3.1-8B-Instruct model that has been fine-tuned using [OPI_full_1.61M.json](https://huggingface.co/datasets/BAAI/OPI/blob/main/OPI_DATA/OPI_full_1.61M_train.json) and subsequently evaluated on the respective testing set for each specific task.
27
+
28
+ <table border="1" style="text-align:center; border-collapse:collapse; width: 100%;">
29
+ <thead>
30
+ <tr>
31
+ <th style="text-align:center;">Task Type</th>
32
+ <th style="text-align:center;">Task Name</th>
33
+ <th style="text-align:center;">Testing file</th>
34
+ <th style="text-align:center;">Accuracy</th>
35
+ <th style="text-align:center;">Precision</th>
36
+ <th style="text-align:center;">Recall</th>
37
+ <th style="text-align:center;">F1</th>
38
+ <th style="text-align:center;">Rouge-L</th>
39
+ </tr>
40
+ </thead>
41
+ <tbody>
42
+ <tr>
43
+ <td rowspan="6">Sequence Understanding</td>
44
+ <td rowspan="2">EC Number Prediction (split100)</td>
45
+ <td>CLEAN_EC_number_new_test</td>
46
+ <td>-</td>
47
+ <td>0.3724</td>
48
+ <td>0.3374</td>
49
+ <td>0.3468</td>
50
+ <td>-</td>
51
+ </tr>
52
+ <tr>
53
+ <td>CLEAN_EC_number_price_test</td>
54
+ <td>-</td>
55
+ <td>0.0738</td>
56
+ <td>0.0738</td>
57
+ <td>0.0738</td>
58
+ <td>-</td>
59
+ </tr>
60
+ <tr>
61
+ <td rowspan="3">Fold Type Prediction</td>
62
+ <td>fold_type_test_Fold_Holdout</td>
63
+ <td>0.1045</td>
64
+ <td>-</td>
65
+ <td>-</td>
66
+ <td>-</td>
67
+ <td>-</td>
68
+ </tr>
69
+ <tr>
70
+ <td>fold_type_test_Superfamily_Holdout</td>
71
+ <td>0.1507</td>
72
+ <td>-</td>
73
+ <td>-</td>
74
+ <td>-</td>
75
+ <td>-</td>
76
+ </tr>
77
+ <tr>
78
+ <td>fold_type_test_Family_Holdout</td>
79
+ <td>0.6145</td>
80
+ <td>-</td>
81
+ <td>-</td>
82
+ <td>-</td>
83
+ <td>-</td>
84
+ </tr>
85
+ <tr>
86
+ <td>Subcellular Localization Prediction</td>
87
+ <td>subcell_loc_test</td>
88
+ <td>0.4214</td>
89
+ <td>-</td>
90
+ <td>-</td>
91
+ <td>-</td>
92
+ <td>-</td>
93
+ </tr>
94
+ <tr>
95
+ <td rowspan="9">Annotation Prediction</td>
96
+ <td rowspan="3">Function Keywords Prediction</td>
97
+ <td>CASPSimilarSeq_keywords_test</td>
98
+ <td>-</td>
99
+ <td>0.4202</td>
100
+ <td>0.5057</td>
101
+ <td>0.4385</td>
102
+ <td>-</td>
103
+ </tr>
104
+ <tr>
105
+ <td>IDFilterSeq_keywords_test</td>
106
+ <td>-</td>
107
+ <td>0.6762</td>
108
+ <td>0.6905</td>
109
+ <td>0.6650</td>
110
+ <td>-</td>
111
+ </tr>
112
+ <tr>
113
+ <td>UniProtSeq_keywords_test</td>
114
+ <td>-</td>
115
+ <td>0.7606</td>
116
+ <td>0.7489</td>
117
+ <td>0.7374</td>
118
+ <td>-</td>
119
+ </tr>
120
+ <tr>
121
+ <td rowspan="3">Gene Ontology(GO) Terms Prediction</td>
122
+ <td>CASPSimilarSeq_go_terms_test</td>
123
+ <td>-</td>
124
+ <td>0.1113</td>
125
+ <td>0.0936</td>
126
+ <td>0.099</td>
127
+ <td>-</td>
128
+ </tr>
129
+ <tr>
130
+ <td>IDFilterSeq_go_terms_test</td>
131
+ <td>-</td>
132
+ <td>0.6686</td>
133
+ <td>0.6287</td>
134
+ <td>0.6304</td>
135
+ <td>-</td>
136
+ </tr>
137
+ <tr>
138
+ <td>UniProtSeq_go_terms_test</td>
139
+ <td>-</td>
140
+ <td>0.7150</td>
141
+ <td>0.6897</td>
142
+ <td>0.6849</td>
143
+ <td>-</td>
144
+ </tr>
145
+ <tr>
146
+ <td rowspan="3">Function Description Prediction</td>
147
+ <td>CASPSimilarSeq_function_test</td>
148
+ <td>-</td>
149
+ <td>-</td>
150
+ <td>-</td>
151
+ <td>-</td>
152
+ <td>0.7524</td>
153
+ </tr>
154
+ <tr>
155
+ <td>IDFilterSeq_function_test</td>
156
+ <td>-</td>
157
+ <td>-</td>
158
+ <td>-</td>
159
+ <td>-</td>
160
+ <td>0.4786</td>
161
+ </tr>
162
+ <tr>
163
+ <td>UniProtSeq_function_test</td>
164
+ <td>-</td>
165
+ <td>-</td>
166
+ <td>-</td>
167
+ <td>-</td>
168
+ <td>0.5144</td>
169
+ </tr>
170
+ <tr>
171
+ <td rowspan="3">Knowledge Mining</td>
172
+ <td>Tissue Location Prediction from Gene Symbol</td>
173
+ <td>gene_symbol_to_tissue_test</td>
174
+ <td>-</td>
175
+ <td>0.4002</td>
176
+ <td>0.9356</td>
177
+ <td>0.5466</td>
178
+ <td>-</td>
179
+ </tr>
180
+ <tr>
181
+ <td>Cancer Prediction from Gene Symbol</td>
182
+ <td>gene_symbol_to_cancer_test</td>
183
+ <td>-</td>
184
+ <td>0.2890</td>
185
+ <td>0.2701</td>
186
+ <td>0.2664</td>
187
+ <td>-</td>
188
+ </tr>
189
+ <tr>
190
+ <td>Cancer Prediction from Gene Name</td>
191
+ <td>gene_name_to_cancer_test</td>
192
+ <td>-</td>
193
+ <td>0.2786</td>
194
+ <td>0.2707</td>
195
+ <td>0.2659</td>
196
+ <td>-</td>
197
+ </tr>
198
+ </tbody>
199
+ </table>
200
+
201
+ # Prediction comparison with SOTA mdoels
202
+
203
+ ![model_compare](model_compare/task1_EC_number.png)
204
+ ![model_compare](model_compare/task2_fold_type.png)
205
+ ![model_compare](model_compare/task3_subcell_loc.png)
206
+ ![model_compare](model_compare/task4_keywords.png)
207
+ ![model_compare](model_compare/task5_GO.png)
208
+ ![model_compare](model_compare/task6_function.png)
209
+ ![model_compare](model_compare/task7_gsymbol2tissue.png)
210
+ ![model_compare](model_compare/task8_gsymbol2cancer.png)
211
+ ![model_compare](model_compare/task9_gname2cancer.png)
212
+
213
+
214
+ # Demo
215
+ We use the [FastChat](https://github.com/lm-sys/FastChat) platform to visually demonstrate the ability of OPI_full_Galactica-6.7B model on various evaluation tasks.
216
+
217
+ ![OPI Demo](./OPI_demo.gif)