license: mit
datasets:
- Xilabs/instructmix
- CreitinGameplays/small-chat-assistant-for-bloom
- sahil2801/CodeAlpaca-20k
language:
- en
BLOOM 3b Fine-tuned for Chat Assistant
Model Name: BLOOM 3b
Model Architecture: bloom
Short Description: This model is a fine-tuned version of the BLOOM 3b large language model, focusing on conversational interactions between a user and an AI assistant. The fine-tuning process leveraged two datasets: "vicgalle/alpaca-gpt4" and "CreitinGameplays/small-chat-assistant-for-bloom". These datasets provided examples of question-and-answer exchanges and dialogues between users and AI assistants.
Intended Use: This model is intended for research purposes and exploration of conversational AI applications. It can be used for tasks like:
- Generating responses to user prompts in a chat assistant setting.
- Creating examples of chatbot interactions for further development.
- Studying the capabilities of large language models for conversation.
Limitations:
- Fine-tuning Focus: The model's performance is optimized for the specific format and context of the fine-tuning data. It may not generalize well to significantly different conversation styles or topics.
- Potential Biases: The model may inherit biases from the training data. It's important to be aware of these potential biases and use the model responsibly.
- Limited Factual Accuracy: Large language models are still under development and may generate responses that are not entirely factually accurate. It's important to verify information generated by the model with other sources.
Specific Input Format:
The model was fine-tuned using a specific input format that separates the system prompt, user prompt, and assistant response with special tokens:
<|system|> {system prompt} </s> <|prompter|> {user prompt} </s> <|assistant|> </s> {model prediction} ```
Using this format when interacting with the model can improve its performance and generate more relevant responses.
**Disclaimer:** This model is for research and exploration purposes only. It should not be used in any applications that require high levels of accuracy or reliability.