CreitinGameplays's picture
Update README.md
bf43f6a verified
|
raw
history blame
2.21 kB
---
license: mit
datasets:
- Xilabs/instructmix
- CreitinGameplays/small-chat-assistant-for-bloom
- sahil2801/CodeAlpaca-20k
language:
- en
---
## BLOOM 3b Fine-tuned for Chat Assistant
**Model Name:** BLOOM 3b
**Model Architecture:** bloom
**Short Description:** This model is a fine-tuned version of the BLOOM 3b large language model, focusing on conversational interactions between a user and an AI assistant. The fine-tuning process leveraged two datasets: "vicgalle/alpaca-gpt4" and "CreitinGameplays/small-chat-assistant-for-bloom". These datasets provided examples of question-and-answer exchanges and dialogues between users and AI assistants.
**Intended Use:** This model is intended for research purposes and exploration of conversational AI applications. It can be used for tasks like:
* Generating responses to user prompts in a chat assistant setting.
* Creating examples of chatbot interactions for further development.
* Studying the capabilities of large language models for conversation.
**Limitations:**
* **Fine-tuning Focus:** The model's performance is optimized for the specific format and context of the fine-tuning data. It may not generalize well to significantly different conversation styles or topics.
* **Potential Biases:** The model may inherit biases from the training data. It's important to be aware of these potential biases and use the model responsibly.
* **Limited Factual Accuracy:** Large language models are still under development and may generate responses that are not entirely factually accurate. It's important to verify information generated by the model with other sources.
**Specific Input Format:**
The model was fine-tuned using a specific input format that separates the system prompt, user prompt, and assistant response with special tokens:
```
<|system|> {system prompt} </s> <|prompter|> {user prompt} </s> <|assistant|> </s> {model prediction} ```
Using this format when interacting with the model can improve its performance and generate more relevant responses.
**Disclaimer:** This model is for research and exploration purposes only. It should not be used in any applications that require high levels of accuracy or reliability.