A country-grounded cross-cultural benchmark for LLM safety and cultural sensitivity across 10 country-language pairs.
AI & ML interests
AI Safety & AI Security
Recent Activity
View all activity
Papers
XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
-
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
Paper • 2601.01836 • Published • 10 -
AIM-Intelligence/COMPASS-Policy-Alignment-Testbed-Dataset
Viewer • Updated • 5.92k • 64 • 11 -
AIM-Intelligence/COMPASS-Policy-aware-SFT-Dataset
Viewer • Updated • 4.12k • 23 • 2 -
AIM-Intelligence/COMPASS_Qwen2.5-7B-Instruct_LoRA
Text Generation • 8B • Updated • 2 • 2
A country-grounded cross-cultural benchmark for LLM safety and cultural sensitivity across 10 country-language pairs.
A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
-
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
Paper • 2601.01836 • Published • 10 -
AIM-Intelligence/COMPASS-Policy-Alignment-Testbed-Dataset
Viewer • Updated • 5.92k • 64 • 11 -
AIM-Intelligence/COMPASS-Policy-aware-SFT-Dataset
Viewer • Updated • 4.12k • 23 • 2 -
AIM-Intelligence/COMPASS_Qwen2.5-7B-Instruct_LoRA
Text Generation • 8B • Updated • 2 • 2
Fine-tuned Model Collections using the "Representation Bending" (REPBEND) approach described in Representation Bending for Large Language Model Safety