prithivMLmods
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,7 @@ tags:
|
|
9 |
- trl
|
10 |
- llama3.2
|
11 |
- Reinforcement learning
|
|
|
12 |
---
|
13 |
# **Bellatrix-Tiny-3B-R1**
|
14 |
|
@@ -65,4 +66,4 @@ Despite its capabilities, Bellatrix has some limitations:
|
|
65 |
2. **Dependence on Training Data**: It is only as good as the quality and diversity of its training data, which may lead to biases or inaccuracies.
|
66 |
3. **Computational Resources**: The model’s optimized transformer architecture can be resource-intensive, requiring significant computational power for fine-tuning and inference.
|
67 |
4. **Language Coverage**: While multilingual, some languages or dialects may have limited support or lower performance compared to widely used ones.
|
68 |
-
5. **Real-World Contexts**: It may struggle with understanding nuanced or ambiguous real-world scenarios not covered during training.
|
|
|
9 |
- trl
|
10 |
- llama3.2
|
11 |
- Reinforcement learning
|
12 |
+
- SFT
|
13 |
---
|
14 |
# **Bellatrix-Tiny-3B-R1**
|
15 |
|
|
|
66 |
2. **Dependence on Training Data**: It is only as good as the quality and diversity of its training data, which may lead to biases or inaccuracies.
|
67 |
3. **Computational Resources**: The model’s optimized transformer architecture can be resource-intensive, requiring significant computational power for fine-tuning and inference.
|
68 |
4. **Language Coverage**: While multilingual, some languages or dialects may have limited support or lower performance compared to widely used ones.
|
69 |
+
5. **Real-World Contexts**: It may struggle with understanding nuanced or ambiguous real-world scenarios not covered during training.
|