File size: 1,272 Bytes
8eeb638
 
20fc13a
 
8eeb638
 
20fc13a
8eeb638
 
b339694
 
 
 
 
 
 
 
20fc13a
 
4a7ba06
20fc13a
0b2b1d4
 
5a28714
0b2b1d4
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
title: README
emoji: 🚀
colorFrom: blue
colorTo: gray
sdk: static
pinned: true
---

# 🐙 Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations

Project Page: [https://mathoctopus.github.io/](https://mathoctopus.github.io/)

Paper: [https://arxiv.org/abs/2310.20246.pdf](https://arxiv.org/abs/2310.20246.pdf)

Code: [https://github.com/microsoft/MathOctopus](https://github.com/microsoft/MathOctopus)

## About Mathoctopus

Mathoctopus is a series of multilingual math reasoning large language models based on LLaMA.

This work pioneers exploring and building powerful  Multilingual Math Reasoning (xMR) LLMs. To accomplish this, we make the following works:

- **MGSM8KInstruct**,  the  multilingual math reasoning instruction dataset,  encompassing ten distinct languages, thus addressing the issue of training data scarcity in xMR tasks.
- **MSVAMP**, an out-of-domain xMR test dataset, to conduct a more exhaustive and comprehensive evaluation of the model’s multilingual mathematical capabilities.
- **MathOctopus**, our effective Multilingual Math Reasoning  LLMs,  training with  different strategies, which notably outperform conventional open-source LLMs and exhibit superiority over ChatGPT in few-shot scenarios.