---
language:
- en
license: apache-2.0
tags:
- sound language model
---

## Model Details

We have developed and released the family [Jan-Llama3-Sound](https://huggingface.co/collections/jan-hq/jan-llama3-668e4dad446c8736208dca4f). This family is natively understanding audio and text input.

We continue to expand [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) with sound understanding capabilities.

This is the initial checkpoint with average weight initialization applied only to new vocabulary.


**Model developers** Homebrew Research.

**Input** Text and sound.

**Output** Text.

**Model Architecture** Llama-3.

**Language(s):** English.

## Intended Use

**Intended Use Cases** This family is primarily intended for research applications. This version aims to further improve the LLM on sound understanding capabilities.

**Out-of-scope** The use of Jan-Llama3-Sound in any manner that violates applicable laws or regulations is strictly prohibited.

## Citation Information

**BibTeX:**

```
@article{Llama-3-Sound: Sound Instruction LLM 2024,
  title={Llama-3-Sound},
  author={Homebrew Research},
  year=2024,
  month=July},
  url={https://huggingface.co/jan-hq/Jan-Llama3-Init}
```

## Acknowledgement

- **[WhisperSpeech](https://github.com/collabora/WhisperSpeech)**

- **[Encodec](https://github.com/facebookresearch/encodec)**

- **[Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)**