UW-NSL

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

fqjiang authored a paper about 1 month ago

CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

fqjiang authored a paper about 1 month ago

Stronger Models are NOT Stronger Teachers for Instruction Tuning

fqjiang authored a paper 6 months ago

ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates

View all activity

UWNSL's activity

fqjiang

authored 2 papers about 1 month ago

CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

Paper • 2406.12257 • Published Jun 18

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11 • 34

fqjiang

authored a paper 6 months ago

ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates

Paper • 2406.12935 • Published Jun 17 • 1

fqjiang

authored 4 papers 7 months ago

SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

Paper • 2402.08983 • Published Feb 14 • 2

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Paper • 2402.11753 • Published Feb 19 • 5

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

Paper • 2401.12242 • Published Jan 20

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65