Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ license: apache-2.0
|
|
10 |
|
11 |
# BiMediX: Bilingual Medical Mixture of Experts LLM
|
12 |
|
13 |
-
Welcome to the official HuggingFace repository for BiMediX, the bilingual medical Large Language Model (LLM) designed for English and Arabic interactions. BiMediX facilitates a broad range of medical interactions
|
14 |
|
15 |
## Key Features
|
16 |
|
@@ -18,7 +18,7 @@ Welcome to the official HuggingFace repository for BiMediX, the bilingual medica
|
|
18 |
- **BiMed1.3M Dataset**: Unique dataset with 1.3 million bilingual medical interactions across English and Arabic, including 250k synthesized multi-turn doctor-patient chats for instruction tuning.
|
19 |
- **High-Quality Translation** : Utilizes a semi-automated English-to-Arabic translation pipeline with human refinement to ensure accuracy and quality in translations.
|
20 |
- **Evaluation Benchmark for Arabic Medical LLMs**: Comprehensive benchmark for evaluating Arabic medical language models, setting a new standard in the field.
|
21 |
-
- **State-of-the-Art Performance**:
|
22 |
|
23 |
|
24 |
## Getting Started
|
@@ -53,18 +53,10 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
|
53 |
|
54 |
## Limitations and Ethical Considerations
|
55 |
|
56 |
-
Users are urged to employ BiMediX responsibly, especially when applying its outputs in real-world medical scenarios.
|
57 |
It is imperative to verify the model's advice with qualified healthcare professionals and not to rely on AI for medical diagnoses or treatment decisions.
|
58 |
Despite the overall advancements BiMediX brings to the field of medical NLP, it shares common challenges with other language models,
|
59 |
-
including hallucinations, toxicity, and stereotypes.
|
60 |
-
|
61 |
-
### Recognized Limitations
|
62 |
-
|
63 |
-
- **Hallucinations**: BiMediX, like many advanced LLMs, can sometimes generate information that, while plausible, may not be factually accurate or relevant to the given context.
|
64 |
-
- **Toxicity and Stereotypes**: The model may inadvertently propagate stereotypes or produce responses that could be considered toxic, reflecting biases present in the training data.
|
65 |
-
- **Accuracy of Medical Diagnoses**: While we have conducted both automatic and qualitative evaluations with medical professionals, we must acknowledge that BiMediX's medical diagnoses and recommendations are not infallible. Users should treat the model's output as informational rather than definitive medical advice.
|
66 |
-
|
67 |
-
We are committed to continuous improvement and ethical AI research, and we look forward to collaborating with the community to address these challenges.
|
68 |
|
69 |
## License and Citation
|
70 |
|
@@ -86,5 +78,4 @@ If you use BiMediX in your research, please cite our work as follows:
|
|
86 |
}
|
87 |
```
|
88 |
|
89 |
-
|
90 |
-
Visit our [GitHub](#) for more information and resources.
|
|
|
10 |
|
11 |
# BiMediX: Bilingual Medical Mixture of Experts LLM
|
12 |
|
13 |
+
Welcome to the official HuggingFace repository for BiMediX, the bilingual medical Large Language Model (LLM) designed for English and Arabic interactions. BiMediX facilitates a broad range of **medical interactions**, including multi-turn chats, multiple-choice Q&A, and open-ended question answering.
|
14 |
|
15 |
## Key Features
|
16 |
|
|
|
18 |
- **BiMed1.3M Dataset**: Unique dataset with 1.3 million bilingual medical interactions across English and Arabic, including 250k synthesized multi-turn doctor-patient chats for instruction tuning.
|
19 |
- **High-Quality Translation** : Utilizes a semi-automated English-to-Arabic translation pipeline with human refinement to ensure accuracy and quality in translations.
|
20 |
- **Evaluation Benchmark for Arabic Medical LLMs**: Comprehensive benchmark for evaluating Arabic medical language models, setting a new standard in the field.
|
21 |
+
- **State-of-the-Art Performance**: Outperforms existing models in medical benchmarks, while 8-times faster than comparable existing models.
|
22 |
|
23 |
|
24 |
## Getting Started
|
|
|
53 |
|
54 |
## Limitations and Ethical Considerations
|
55 |
|
56 |
+
**This release, intended for research, is not ready for clinical or commercial use.** Users are urged to employ BiMediX responsibly, especially when applying its outputs in real-world medical scenarios.
|
57 |
It is imperative to verify the model's advice with qualified healthcare professionals and not to rely on AI for medical diagnoses or treatment decisions.
|
58 |
Despite the overall advancements BiMediX brings to the field of medical NLP, it shares common challenges with other language models,
|
59 |
+
including hallucinations, toxicity, and stereotypes. BiMediX's medical diagnoses and recommendations are not infallible.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
60 |
|
61 |
## License and Citation
|
62 |
|
|
|
78 |
}
|
79 |
```
|
80 |
|
81 |
+
Visit our [GitHub](https://github.com/mbzuai-oryx/BiMediX) for more information and resources.
|
|