|
--- |
|
license: apache-2.0 |
|
--- |
|
|
|
# **GeM2-Llamion-14B** |
|
|
|
We have released **Llamion** as **GeM 2.0**, the second series of generative models developed by VAIV Company to address the our principal business needs. |
|
|
|
**Llamion** (Llamafied Orion) is derived from transforming the [Orion model](https://huggingface.co/OrionStarAI/Orion-14B-Base) |
|
into [the standard LLaMA architecture](https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py) |
|
through parameter mapping and offline knowledge transfer. |
|
Further technical specifications and study results will be detailed in our upcoming paper, available on this page. |
|
|
|
<!-- Note that this model has NOT been contaminated to artificially inflate its scores for the Open LLM Leaderboards, |
|
unlike some recent models which have been intentionally tainted. --> |
|
|
|
![vaiv_png](./vaiv.png) |
|
|
|
### Contributors |
|
|
|
- VAIV Company AI Lab ([vaiv.kr](https://www.vaiv.kr/)) |