title: README | |
emoji: 🏢 | |
colorFrom: purple | |
colorTo: blue | |
sdk: static | |
pinned: false | |
Welcome to the official repository for DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision-Language Models. This repository contains the code, resources, and documentation supporting our paper, which introduces DynaMath: a benchmark designed to rigorously evaluate mathematical reasoning across various vision-language models (VLMs). | |
For further details, including the benchmark leaderboard, please visit our [project website](https://dynamath.github.io) and our [preprint paper](https://huan-zhang.com/DynaMath.pdf). | |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64d45451c34a346181b130dd/vK6Z0E8Qz4xV3yAZlKxq1.png) | |