Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension
Paper
•
2002.00293
•
Published
Language models, reasoning, robustness, question answering, evaluation, theorem proving, knowledge graphs, mechanistic interpretability, adversarial training, dynamic adversarial data collection, in-context learning, natural language explanations, safety and security, self-training, knowledge distillation, natural language processing