loodvanniekerkginkgo commited on
Commit
0de3557
·
1 Parent(s): 203f3f3

Added more information on mAbs

Browse files
Files changed (1) hide show
  1. about.py +4 -1
about.py CHANGED
@@ -20,7 +20,7 @@ Here we invite the community to submit and develop better predictors, which will
20
  #### 🏆 Prizes
21
 
22
  For each of the 5 properties in the competition, there is a prize for the model with the highest performance for that property on the private test set.
23
- There is also an 'open-source' prize for the best model trained on the GDPa1 dataset (reporting cross-validation results) and assessed on the private test set where authors provide all training code and data.
24
  For each of these 6 prizes, participants have the choice between **$10k in data generation credits** with [Ginkgo Datapoints](https://datapoints.ginkgo.bio/) or a **cash prize** with a value of $2000.
25
 
26
  See the "{FAQ_TAB_NAME}" tab above (you are currently on the "{ABOUT_TAB_NAME}" tab) or the [competition terms]({TERMS_URL}) for more details.
@@ -71,6 +71,9 @@ FAQS = {
71
  "Note: These assays may differ from previously published methods, and these correlations between literature data and experimental data are also described in the preprint. "
72
  "These same methods are used to generate the heldout test data."
73
  ),
 
 
 
74
  "How were the heldout sequences designed?": (
75
  "We sampled 80 paired antibody sequences from [OAS](https://opig.stats.ox.ac.uk/webapps/oas/). We tried to represent the range of germline variants, sequence identities to germline, and CDR3 lengths. "
76
  "The sequences in the dataset are quite diverse as measured by pairwise sequence identity."
 
20
  #### 🏆 Prizes
21
 
22
  For each of the 5 properties in the competition, there is a prize for the model with the highest performance for that property on the private test set.
23
+ There is also an 'open-source' prize for the best model trained on the GDPa1 dataset of monoclonal antibodies (reporting cross-validation results) and assessed on the private test set where authors provide all training code and data.
24
  For each of these 6 prizes, participants have the choice between **$10k in data generation credits** with [Ginkgo Datapoints](https://datapoints.ginkgo.bio/) or a **cash prize** with a value of $2000.
25
 
26
  See the "{FAQ_TAB_NAME}" tab above (you are currently on the "{ABOUT_TAB_NAME}" tab) or the [competition terms]({TERMS_URL}) for more details.
 
71
  "Note: These assays may differ from previously published methods, and these correlations between literature data and experimental data are also described in the preprint. "
72
  "These same methods are used to generate the heldout test data."
73
  ),
74
+ "What do the datasets contain?": (
75
+ "Both the GDPa1 and heldout test set contain the VH and VL sequences, as well as the full heavy chain sequence. The GDPa1 dataset is a mix of IgG1, IgG2, and IgG4 antibodies while the heldout test set only contains IgG1 antibodies. We also include the light chain subtype (lambda or kappa)."
76
+ ),
77
  "How were the heldout sequences designed?": (
78
  "We sampled 80 paired antibody sequences from [OAS](https://opig.stats.ox.ac.uk/webapps/oas/). We tried to represent the range of germline variants, sequence identities to germline, and CDR3 lengths. "
79
  "The sequences in the dataset are quite diverse as measured by pairwise sequence identity."