Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published Feb 15 • 1
Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published Feb 15 • 1
Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges Paper • 2602.13576 • Published Feb 14 • 2
Rubrics as an Attack Surface (RIPD) Collection This collection releases the official artifacts accompanying “Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges.” • 10 items • Updated Mar 2 • 1
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated Feb 21 • 41
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated Feb 21 • 41
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated Feb 21 • 43
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated Feb 21 • 43