Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale Paper • 2211.03759 • Published Nov 7, 2022
Contrastive Language-Image Pre-training for the Italian Language Paper • 2108.08688 • Published Aug 19, 2021 • 2
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models Paper • 2308.01263 • Published Aug 2, 2023
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions Paper • 2309.07875 • Published Sep 14, 2023
When and why vision-language models behave like bags-of-words, and what to do about it? Paper • 2210.01936 • Published Oct 4, 2022
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 10
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models Paper • 2308.01263 • Published Aug 2, 2023
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions Paper • 2309.07875 • Published Sep 14, 2023
Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features Paper • 2309.07733 • Published Sep 14, 2023
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation Paper • 2310.12127 • Published Oct 18, 2023 • 1
Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists Paper • 2203.09192 • Published Mar 17, 2022
Contrastive Language-Image Pre-training for the Italian Language Paper • 2108.08688 • Published Aug 19, 2021 • 2