Research CrowdMath exposes the math gap AI agents still miss CrowdMath is a dataset of 164 annotated math research chains. Use it to test whether models understand progress, not just answers. Lars Cornelissen · Jun 8, 2026