Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ This model is a fine-tuned version of [bigcode/starcoder2-3b](https://huggingfac
|
|
20 |
|
21 |
## Model description
|
22 |
|
23 |
-
|
24 |
|
25 |
## Intended uses & limitations
|
26 |
|
|
|
20 |
|
21 |
## Model description
|
22 |
|
23 |
+
StarCoder2-3B model is a 3B parameter model trained on 17 programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 3+ trillion tokens.
|
24 |
|
25 |
## Intended uses & limitations
|
26 |
|