|
--- |
|
title: README |
|
emoji: π |
|
colorFrom: red |
|
colorTo: red |
|
sdk: static |
|
pinned: false |
|
--- |
|
|
|
# ALM: Audio Language and Multimodal |
|
|
|
ALM is a collaborative research group focused on deep learning for audio, language, and multimodal data. |
|
|
|
### About Us |
|
|
|
- **Alkis Koudounas** - PhD Student at Politecnico di Torino ([Profile](https://huggingface.co/alkiskoudounas) | [polito.it](https://www.polito.it)) |
|
- **Lorenzo Vaiani** - PhD Student at Politecnico di Torino ([Profile](https://huggingface.co/VaianiLorenzo) | [polito.it](https://www.polito.it)) |
|
- **Moreno La Quatra** - Research Fellow at Kore University of Enna ([Profile](https://huggingface.co/morenolq) | [unikore.it](https://www.unikore.it)) |
|
|
|
### Projects |
|
|
|
- **ARCH** - [Audio Representation Benchmark](https://huggingface.co/spaces/ALM/ARCH) ([Repo](https://github.com/MorenoLaQuatra/ARCH)): A platform dedicated to benchmarking models for audio representations. [Resaerch Paper](https://huggingface.co/papers/2405.00934) |
|
- **CALM** - [Contrastive Alignment of Language and Music](https://github.com/ALM-LAB/CALM): A project from the 1st Sound of AI Hackathon. CALM aligns songs with natural language descriptions, enabling music searches via text, voice, or facial expressions. |
|
- **PACE** - [Podcast AI for Chapters and Episodes](https://github.com/ALM-LAB/PACE): PACE is a semantic search engine for podcasts. It enables users to search for specific parts of a podcast using natural language. The project was created for the AssemblyAI 50K Hackathon - Winter 2022. |
|
--- |
|
|
|
|