Original model: https://huggingface.co/DAMO-NLP-MT/polylm-multialpaca-13b

Model Card for PolyLM-Multialpaca

This model is finetuned on polyLM-13b using multialpaca (a self-instruction dataset)

Demo

Open

Bias, Risks, and Limitations

The information below in this section are copied from the model's official model card:

Our contributions are fully methodological: adding the support of multilingualism to LLM during training and SFT phases. It is unavoidable that PolyLM might exhibit several common deficiencies of language models, e.g. hallucination and toxicity. PolyLM should not be used directly in any application, without a prior assessment of safety and fairness concerns specific to the application.

This version activates the instruction-following capability of PolyLM through self-instruction, but currently, the training instructions are relatively simple and the support for abilities such as multi-turn dialogue, context understanding, CoT, Plugin, etc. is not very friendly. We are making efforts to develop a new version.

Citation

BibTeX:

@misc{wei2023polylm,
    title={PolyLM: An Open Source Polyglot Large Language Model},
    author={Xiangpeng Wei and Haoran Wei and Huan Lin and Tianhao Li and Pei Zhang and Xingzhang Ren and Mei Li and Yu Wan and Zhiwei Cao and Binbin Xie and Tianxiang Hu and Shangjie Li and Binyuan Hui and Bowen Yu and Dayiheng Liu and Baosong Yang and Fei Huang and Jun Xie},
    year={2023},
    eprint={2307.06018},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}
Downloads last month
20
GGUF
Model size
15.2B params
Architecture
gpt2

5-bit

6-bit

Inference API
Unable to determine this model's library. Check the docs .