Sara Papi's picture

1 20 3

Sara Papi

sarapapi

·

sarapapi

AI & ML interests

None yet

Organizations

sarapapi's activity

upvoted 2 papers 6 months ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29, 2024 • 54

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 51

upvoted 18 papers over 1 year ago

Visualization: the missing factor in Simultaneous Speech Translation

Paper • 2111.00514 • Published Oct 31, 2021 • 1

Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation

Paper • 2206.05807 • Published Jun 12, 2022 • 1

Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments

Paper • 2307.03354 • Published Jul 7, 2023 • 1

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP

Paper • 2303.16166 • Published Mar 28, 2023 • 1

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023

Paper • 2309.15554 • Published Sep 27, 2023 • 1

Simultaneous Speech Translation for Live Subtitling: from Delay to Display

Paper • 2107.08807 • Published Jul 19, 2021 • 2

Mixtures of Deep Neural Experts for Automated Speech Scoring

Paper • 2106.12475 • Published Jun 23, 2021 • 2

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

Paper • 2310.15752 • Published Oct 24, 2023 • 1

Dealing with training and test segmentation mismatch: FBK@IWSLT2021

Paper • 2106.12607 • Published Jun 23, 2021 • 1

Joint Speech Translation and Named Entity Recognition

Paper • 2210.11987 • Published Oct 21, 2022 • 1

Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora

Paper • 2209.10608 • Published Sep 21, 2022 • 1

Does Simultaneous Speech Translation need Simultaneous Models?

Paper • 2204.03783 • Published Apr 8, 2022 • 1

Speechformer: Reducing Information Loss in Direct Speech Translation

Paper • 2109.04574 • Published Sep 9, 2021 • 1

Efficient yet Competitive Speech Translation: FBK@IWSLT2022

Paper • 2205.02629 • Published May 5, 2022 • 1

Direct Speech Translation for Automatic Subtitling

Paper • 2209.13192 • Published Sep 27, 2022 • 1

Attention as a Guide for Simultaneous Speech Translation

Paper • 2212.07850 • Published Dec 15, 2022 • 1

AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation

Paper • 2305.11408 • Published May 19, 2023 • 1

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 59