PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation Paper • 2502.20377 • Published Feb 27