|
--- |
|
license: mit |
|
datasets: |
|
- Xilabs/instructmix |
|
- CreitinGameplays/small-chat-assistant-for-bloom |
|
- sahil2801/CodeAlpaca-20k |
|
language: |
|
- en |
|
--- |
|
## BLOOM 3b Fine-tuned for Chat Assistant |
|
|
|
**Model Name:** BLOOM 3b |
|
|
|
**Model Architecture:** bloom |
|
|
|
**Short Description:** This model is a fine-tuned version of the BLOOM 3b large language model, focusing on conversational interactions between a user and an AI assistant. The fine-tuning process leveraged two datasets: "vicgalle/alpaca-gpt4" and "CreitinGameplays/small-chat-assistant-for-bloom". These datasets provided examples of question-and-answer exchanges and dialogues between users and AI assistants. |
|
|
|
**Intended Use:** This model is intended for research purposes and exploration of conversational AI applications. It can be used for tasks like: |
|
|
|
* Generating responses to user prompts in a chat assistant setting. |
|
* Creating examples of chatbot interactions for further development. |
|
* Studying the capabilities of large language models for conversation. |
|
|
|
**Limitations:** |
|
|
|
* **Fine-tuning Focus:** The model's performance is optimized for the specific format and context of the fine-tuning data. It may not generalize well to significantly different conversation styles or topics. |
|
* **Potential Biases:** The model may inherit biases from the training data. It's important to be aware of these potential biases and use the model responsibly. |
|
* **Limited Factual Accuracy:** Large language models are still under development and may generate responses that are not entirely factually accurate. It's important to verify information generated by the model with other sources. |
|
|
|
**Specific Input Format:** |
|
|
|
The model was fine-tuned using a specific input format that separates the system prompt, user prompt, and assistant response with special tokens: |
|
|
|
``` |
|
<|system|> {system prompt} </s> <|prompter|> {user prompt} </s> <|assistant|> </s> {model prediction} ``` |
|
|
|
Using this format when interacting with the model can improve its performance and generate more relevant responses. |
|
|
|
**Disclaimer:** This model is for research and exploration purposes only. It should not be used in any applications that require high levels of accuracy or reliability. |