Explore a GDPval multi-agent task-generation pipeline
Generate a tiered rubric from a task record JSON