Pranabit commited on
Commit
b260baa
1 Parent(s): 95dd3a9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -20,7 +20,7 @@ This model is a fine-tuned version of [bigcode/starcoder2-3b](https://huggingfac
20
 
21
  ## Model description
22
 
23
- More information needed
24
 
25
  ## Intended uses & limitations
26
 
 
20
 
21
  ## Model description
22
 
23
+ StarCoder2-3B model is a 3B parameter model trained on 17 programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 3+ trillion tokens.
24
 
25
  ## Intended uses & limitations
26