|
--- |
|
library_name: transformers |
|
base_model: 152334H/miqu-1-70b-sf |
|
tags: |
|
- medical |
|
license: cc-by-nc-2.0 |
|
pipeline_tag: text-generation |
|
language: |
|
- en |
|
--- |
|
|
|
# Dr. Niko (70B) |
|
|
|
> This repository contains the full merged model. |
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/639a31227ecb808549a0d18d/zo8Qi2hVnnurxyGfbU2rA.png) |
|
|
|
The Dr-Niko model is designed to assist medical professionals, researchers, and students with a wide range of tasks, including answering medical and scientific questions, summarizing research papers and clinical notes, generating medical reports and documentation, providing medical advice and recommendations (with appropriate disclaimers), and assisting with medical decision-making and diagnosis (in a supporting role). |
|
|
|
|
|
## Model Details |
|
|
|
- **Model Name**: Dr-Niko |
|
- **Model Type**: Medical Large Language Model (LLM) |
|
- **Model Size**: 70 billion parameters (base model is miqu-70B) |
|
- **Training Data**: The model was fine-tuned on a curated dataset of high-quality medical and scientific literature. |
|
- **Fine-Tuning Approach**: The model was then fine-tuned on a medical and scientific dataset using [LLaMa-Factory ](https://github.com/hiyouga/LLaMA-Factory) for 1.5 epochs. |
|
- **Intended Use**: The Dr-Niko model is designed to assist medical professionals, researchers, and students with a wide range of tasks, including: |
|
- Answering medical and scientific questions |
|
- Summarizing research papers and clinical notes (in a supporting role) |
|
- Generating medical reports and documentation (in a supporting role) |
|
- Providing medical advice and recommendations (with appropriate disclaimers) |
|
- Assisting with medical decision-making and diagnosis (in a supporting role) |
|
|
|
### Model Description |
|
|
|
This model is next in a series of medical finetuning attempts, following medfalcon and medguanaco. |
|
|
|
- **Developed by:** [Nick Mitchko](https://www.linkedin.com/in/nmitchko/) |
|
- **Funded by :** [My Bank Account] |
|
- **Shared by :** [My Internet] |
|
- **Model type:** [LLaMa-70B variant] |
|
- **Language(s) (NLP):** [English] |
|
- **License:** [See [here](#license---nomerge)] |
|
- **Finetuned from model [optional]:** [Miqu-70B](https://huggingface.co/152334H/miqu-1-70b-sf) |
|
|
|
### Examples |
|
|
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/639a31227ecb808549a0d18d/kHbTSRw-tcp5vPEmtLl1Q.png) |
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/639a31227ecb808549a0d18d/cy46MfNI_bHnnxQB_caIe.png) |
|
|
|
### Model Sources |
|
|
|
<!-- Provide the basic links for the model. --> |
|
|
|
- **Repository:** [Coming Soon] |
|
- **Paper []:** [Maybe] |
|
- **Demo []:** Maybe |
|
<!-- |
|
## Uses |
|
|
|
|
|
### Direct Use |
|
|
|
|
|
|
|
[More Information Needed] |
|
|
|
### Downstream Use [optional] |
|
|
|
|
|
|
|
[More Information Needed] |
|
|
|
### Out-of-Scope Use |
|
|
|
|
|
|
|
[More Information Needed] |
|
|
|
## Bias, Risks, and Limitations |
|
|
|
|
|
[More Information Needed] |
|
|
|
### Recommendations |
|
|
|
|
|
|
|
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. |
|
|
|
## How to Get Started with the Model |
|
|
|
Use the code below to get started with the model. |
|
|
|
[More Information Needed] |
|
|
|
## Training Details |
|
|
|
### Training Data |
|
|
|
[More Information Needed] |
|
|
|
### Training Procedure |
|
|
|
#### Preprocessing [optional] |
|
|
|
[More Information Needed] |
|
|
|
|
|
#### Training Hyperparameters |
|
|
|
#### Speeds, Sizes, Times [optional] |
|
|
|
[More Information Needed] |
|
|
|
## Evaluation |
|
|
|
|
|
|
|
### Testing Data, Factors & Metrics |
|
|
|
#### Testing Data |
|
|
|
[More Information Needed] |
|
|
|
#### Factors |
|
|
|
[More Information Needed] |
|
--> |
|
#### Metrics |
|
|
|
[Formal Evaluation Coming Soon] |
|
|
|
<!-- |
|
### Results |
|
|
|
[More Information Needed] |
|
|
|
#### Summary |
|
|
|
--> |
|
<!-- |
|
## Model Examination [optional] |
|
--> |
|
<!-- Relevant interpretability work for the model goes here --> |
|
<!-- |
|
[More Information Needed] |
|
|
|
## Environmental Impact |
|
--> |
|
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly --> |
|
<!-- |
|
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). |
|
|
|
- **Hardware Type:** [More Information Needed] |
|
- **Hours used:** [More Information Needed] |
|
- **Cloud Provider:** [More Information Needed] |
|
- **Compute Region:** [More Information Needed] |
|
- **Carbon Emitted:** [More Information Needed] |
|
|
|
## Technical Specifications [optional] |
|
|
|
### Model Architecture and Objective |
|
|
|
[More Information Needed] |
|
|
|
### Compute Infrastructure |
|
|
|
[More Information Needed] |
|
|
|
#### Hardware |
|
|
|
[More Information Needed] |
|
|
|
#### Software |
|
|
|
[More Information Needed] |
|
--> |
|
|
|
## Citation [optional] |
|
|
|
Information Coming Soon |
|
|
|
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. --> |
|
<!-- |
|
**BibTeX:** |
|
|
|
[More Information Needed] |
|
|
|
**APA:** |
|
|
|
[More Information Needed] |
|
|
|
## Glossary [optional] |
|
--> |
|
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. --> |
|
<!-- |
|
[More Information Needed] |
|
|
|
## More Information [optional] |
|
|
|
[More Information Needed] |
|
|
|
## Model Card Authors [optional] |
|
|
|
[More Information Needed] |
|
|
|
## Model Card Contact |
|
|
|
[More Information Needed] |
|
|
|
--> |
|
|
|
## License - NOMERGE |
|
|
|
``` |
|
NOMERGE License |
|
|
|
Copyright (c) 2024 152334H |
|
|
|
Permission is hereby granted, free of charge, to any person obtaining a copy |
|
of this software and associated documentation files (the "Software"), to deal |
|
in the Software without restriction, including without limitation the rights |
|
to use, copy, modify, NOT merge, publish, distribute, sublicense, and/or sell |
|
copies of the Software, and to permit persons to whom the Software is |
|
furnished to do so, subject to the following conditions: |
|
|
|
The above copyright notice and this permission notice shall be included in all |
|
copies or substantial portions of the Software. |
|
|
|
All tensors ("weights") provided by the Software shall not be conjoined with |
|
other tensors ("merging") unless given explicit permission by the license holder. |
|
Utilities including but not limited to "mergekit", "MergeMonster", are forbidden |
|
from use in conjunction with this Software. |
|
|
|
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR |
|
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, |
|
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE |
|
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER |
|
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, |
|
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE |
|
SOFTWARE. |
|
``` |