prithivMLmods commited on
Commit
7813dfb
·
verified ·
1 Parent(s): f539049

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -9,6 +9,7 @@ tags:
9
  - trl
10
  - llama3.2
11
  - Reinforcement learning
 
12
  ---
13
  # **Bellatrix-Tiny-3B-R1**
14
 
@@ -65,4 +66,4 @@ Despite its capabilities, Bellatrix has some limitations:
65
  2. **Dependence on Training Data**: It is only as good as the quality and diversity of its training data, which may lead to biases or inaccuracies.
66
  3. **Computational Resources**: The model’s optimized transformer architecture can be resource-intensive, requiring significant computational power for fine-tuning and inference.
67
  4. **Language Coverage**: While multilingual, some languages or dialects may have limited support or lower performance compared to widely used ones.
68
- 5. **Real-World Contexts**: It may struggle with understanding nuanced or ambiguous real-world scenarios not covered during training.
 
9
  - trl
10
  - llama3.2
11
  - Reinforcement learning
12
+ - SFT
13
  ---
14
  # **Bellatrix-Tiny-3B-R1**
15
 
 
66
  2. **Dependence on Training Data**: It is only as good as the quality and diversity of its training data, which may lead to biases or inaccuracies.
67
  3. **Computational Resources**: The model’s optimized transformer architecture can be resource-intensive, requiring significant computational power for fine-tuning and inference.
68
  4. **Language Coverage**: While multilingual, some languages or dialects may have limited support or lower performance compared to widely used ones.
69
+ 5. **Real-World Contexts**: It may struggle with understanding nuanced or ambiguous real-world scenarios not covered during training.