Moreno La Quatra's picture

Moreno La Quatra

morenolq

·

https://mlaquatra.me/

AI & ML interests

NLP, Multimodal Learning, Audio Processing

Recent Activity

liked a model about 1 month ago

nvidia/personaplex-7b-v1

updated a model about 2 months ago

ALM/hubert-large-audioset

updated a model about 2 months ago

ALM/wav2vec2-large-audioset

View all activity

Organizations

upvoted a collection 7 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 416

upvoted a collection 9 months ago

FAMA

The First Large-Scale Open-Science Speech Foundation Model for English and Italian • 5 items • Updated May 30, 2025 • 10

upvoted 3 papers 12 months ago

FlanEC: Exploring Flan-T5 for Post-ASR Error Correction

Paper • 2501.12979 • Published Jan 22, 2025 • 1

Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions

Paper • 2406.16128 • Published Jun 23, 2024 • 1

voc2vec: A Foundation Model for Non-Verbal Vocalization

Paper • 2502.16298 • Published Feb 22, 2025 • 1

upvoted 4 collections about 1 year ago

Text Style Transfer

Model checkpoints of the paper "Self-supervised Text Style Transfer Using Cycle-Consistent Adversarial Networks" • 33 items • Updated Dec 1, 2024 • 2

SEAHORSE release

The SEAHORSE metrics (as described in https://arxiv.org/abs/2305.13194). • 12 items • Updated Jul 10, 2025 • 20

MT5 release

The MT5 release follows the T5 family, but is pretrained on multilingual data. The update UMT5 models are pretrained on an updated corpus. • 10 items • Updated Jul 10, 2025 • 23

Health AI Developer Foundations (HAI-DEF)

Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated Jan 12 • 199

upvoted 4 collections over 1 year ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1, 2025 • 574

mHuBERT-147 models

Compact yet powerful multilingual speech representation models based on the HuBERT architecture. • 3 items • Updated Jun 4, 2024 • 8

Salamandra 🦎

15 items • Updated 15 days ago • 62

LLaVa-NeXT

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 32

upvoted 2 papers over 1 year ago

Benchmarking Representations for Speech, Music, and Acoustic Events

Paper • 2405.00934 • Published May 2, 2024 • 1

Speech Analysis of Language Varieties in Italy

Paper • 2406.15862 • Published Jun 22, 2024 • 2

upvoted a paper about 2 years ago

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20, 2024 • 100

upvoted a collection about 2 years ago

XLSR

A collection of multilingual Wav2Vec 2.0 checkpoints pre-trained on 53 languages and fine-tuned for CTC speech recognition. • 12 items • Updated Jan 16, 2024 • 8

upvoted a paper about 2 years ago

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4, 2024 • 65

upvoted a collection over 2 years ago

🇮🇹 Italian NLP Resources

Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 308 items • Updated 8 days ago • 31

upvoted a paper over 2 years ago

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 87