Evaluating Social Impacts of Generative AI

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

evijit authored a paper about 9 hours ago

Stop treating `AGI' as the north-star goal of AI research

evijit authored a paper about 9 hours ago

"It's not a representation of me": Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services

evijit authored a paper 24 days ago

Fully Autonomous AI Agents Should Not be Developed

View all activity

evaleval's activity

evijit

authored 2 papers about 9 hours ago

Stop treating `AGI' as the north-star goal of AI research

Paper • 2502.03689 • Published Feb 6 • 3

"It's not a representation of me": Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services

Paper • 2504.09346 • Published 6 days ago • 2

yjernite

posted an update 2 days ago

Post

2454

Today in Privacy & AI Tooling - introducing a nifty new tool to examine where data goes in open-source apps on 🤗

HF Spaces have tons (100Ks!) of cool demos leveraging or examining AI systems - and because most of them are OSS we can see exactly how they handle user data 📚🔍

That requires actually reading the code though, which isn't always easy or quick! Good news: code LMs have gotten pretty good at automatic review, so we can offload some of the work - here I'm using Qwen/Qwen2.5-Coder-32B-Instruct to generate reports and it works pretty OK 🙌

The app works in three stages:
1. Download all code files
2. Use the Code LM to generate a detailed report pointing to code where data is transferred/(AI-)processed (screen 1)
3. Summarize the app's main functionality and data journeys (screen 2)
4. Build a Privacy TLDR with those inputs

It comes with a bunch of pre-reviewed apps/Spaces, great to see how many process data locally or through (private) HF endpoints 🤗

Note that this is a POC, lots of exciting work to do to make it more robust, so:
- try it: yjernite/space-privacy
- reach out to collab: yjernite/space-privacy

evijit

authored a paper 24 days ago

Fully Autonomous AI Agents Should Not be Developed

Paper • 2502.02649 • Published Feb 4 • 33

meg

authored a paper 24 days ago

Fully Autonomous AI Agents Should Not be Developed

Paper • 2502.02649 • Published Feb 4 • 33

yjernite

authored a paper about 2 months ago

Beyond Release: Access Considerations for Generative AI Systems

Paper • 2502.16701 • Published Feb 23 • 16

irenesolaiman

authored 2 papers about 2 months ago

Process for Adapting Language Models to Society (PALMS) with Values-Targeted Datasets

Paper • 2106.10328 • Published Jun 18, 2021

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 31

canyuchen

authored a paper about 2 months ago

SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

Paper • 2502.13233 • Published Feb 18 • 14

alinaleidinger

authored 2 papers about 2 months ago

Probing LLMs for Joint Encoding of Linguistic Categories

Paper • 2310.18696 • Published Oct 28, 2023 • 1

How far can bias go? -- Tracing bias from pretraining data to alignment

Paper • 2411.19240 • Published Nov 28, 2024

felfri

authored a paper 3 months ago

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Paper • 2501.10057 • Published Jan 17 • 8

meg

posted an update 3 months ago

Post

3354

💫...And we're live!💫 Seasonal newsletter from ethicsy folks at Hugging Face, exploring the ethics of "AI Agents"
https://huggingface.co/blog/ethics-soc-7
Our analyses found:
- There's a spectrum of "agent"-ness
- *Safety* is a key issue, leading to many other value-based concerns
Read for details & what to do next!
With @evijit , @giadap , and @sasha

yjernite

posted an update 3 months ago

Post

2412

🤗👤 💻 Speaking of AI agents ...
...Is easier with the right words ;)

My colleagues @meg @evijit @sasha and @giadap just published a wonderful blog post outlining some of the main relevant notions with their signature blend of value-informed and risk-benefits contrasting approach. Go have a read!

https://huggingface.co/blog/ethics-soc-7

alinaleidinger

authored a paper 3 months ago

CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models

Paper • 2405.13974 • Published May 22, 2024 • 9

felfri

authored a paper 4 months ago

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Paper • 2412.15035 • Published Dec 19, 2024 • 4

yjernite

posted an update 4 months ago

Post

2244

🇪🇺 Policy Thoughts in the EU AI Act Implementation 🇪🇺

There is a lot to like in the first draft of the EU GPAI Code of Practice, especially as regards transparency requirements. The Systemic Risks part, on the other hand, is concerning for both smaller developers and for external stakeholders.

I wrote more on this topic ahead of the next draft. TLDR: more attention to immediate large-scale risks and to collaborative solutions supported by evidence can help everyone - as long as developers disclose sufficient information about their design choices and deployment contexts.

Full blog here, based on our submitted response with @frimelle and @brunatrevelin :

https://huggingface.co/blog/yjernite/eu-draft-cop-risks#on-the-proposed-taxonomy-of-systemic-risks

2 replies

canyuchen

authored 2 papers 5 months ago

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 41

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Paper • 2411.06469 • Published Nov 10, 2024 • 17

canyuchen

authored a paper 6 months ago

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published Oct 21, 2024 • 56

AI & ML interests

Recent Activity

Team members 11

evaleval's activity