zhongwang commited on
Commit
2bc4e14
·
verified ·
1 Parent(s): ea7185c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -1,5 +1,14 @@
1
  ---
2
  license: bsd
 
 
 
 
 
 
 
 
 
3
  ---
4
 
5
  This is GenomeOcean-bgcFM, a model finetuned on biosynthetic gene clusters (BGC) sequences. It is built on GenomeOcean-4B. It is trained with Causal Language Modeling (CLM) and uses a BPE tokenizer with 4096 tokens. It supports a maximum sequence length of 10240 tokens (~50kbp).
 
1
  ---
2
  license: bsd
3
+ tags:
4
+ - biology
5
+ - genomics
6
+ - metagenomics
7
+ - DNA
8
+ - microbiome
9
+ - biosynthetic gene clusters
10
+ base_model:
11
+ - pGenomeOcean/GenomeOcean-4B
12
  ---
13
 
14
  This is GenomeOcean-bgcFM, a model finetuned on biosynthetic gene clusters (BGC) sequences. It is built on GenomeOcean-4B. It is trained with Causal Language Modeling (CLM) and uses a BPE tokenizer with 4096 tokens. It supports a maximum sequence length of 10240 tokens (~50kbp).