--- language: - en license: apache-2.0 tags: - sound language model --- ## Model Details We have developed and released the family [Jan-Llama3-Sound](https://huggingface.co/collections/jan-hq/jan-llama3-668e4dad446c8736208dca4f). This family is natively understanding audio and text input. We continue to expand [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) with sound understanding capabilities. This is the initial checkpoint with average weight initialization applied only to new vocabulary. **Model developers** Homebrew Research. **Input** Text and sound. **Output** Text. **Model Architecture** Llama-3. **Language(s):** English. ## Intended Use **Intended Use Cases** This family is primarily intended for research applications. This version aims to further improve the LLM on sound understanding capabilities. **Out-of-scope** The use of Jan-Llama3-Sound in any manner that violates applicable laws or regulations is strictly prohibited. ## Citation Information **BibTeX:** ``` @article{Llama-3-Sound: Sound Instruction LLM 2024, title={Llama-3-Sound}, author={Homebrew Research}, year=2024, month=July}, url={https://huggingface.co/jan-hq/Jan-Llama3-Init} ``` ## Acknowledgement - **[WhisperSpeech](https://github.com/collabora/WhisperSpeech)** - **[Encodec](https://github.com/facebookresearch/encodec)** - **[Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)**