MathInstruct

Introduced by Yue et al. in MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

MathInstruct is a meticulously curated instruction tuning dataset that combines data from 13 mathematical rationale datasets. It uniquely focuses on the hybrid use of chain-of-thought (CoT) and program-of-thought (PoT) rationales, ensuring extensive coverage of diverse mathematical fields¹²³.

Here are some key points about the MathInstruct dataset:

Compilation: MathInstruct is compiled from 13 math rationale datasets, six of which are newly curated by this work.
Instruction Types: It emphasizes both CoT (chain-of-thought) and PoT (program-of-thought) rationales, providing a rich foundation of intermediate reasoning.
Coverage: The dataset spans various mathematical topics, making it valuable for training and evaluating models in mathematical reasoning.

For more details, you can explore the MathInstruct dataset on Hugging Face or visit the project page¹⁴. 📚🧮

(1) TIGER-Lab/MathInstruct · Datasets at Hugging Face. https://huggingface.co/datasets/TIGER-Lab/MathInstruct. (2) Mathematical Reasoning: Open-Source LLMs with Hybrid Instructional .... https://news.superagi.com/2023/09/12/mathematical-reasoning-mammoth-models-elevate-open-source-llms-with-hybrid-instructional-techniques/. (3) OpenDataLab 引领AI大模型时代的开放数据平台. https://opendatalab.com/OpenDataLab/MathInstruct. (4) MathInstruct. https://www.modelscope.cn/datasets/AI-ModelScope/MathInstruct/summary. (5) undefined. https://tiger-ai-lab.github.io/MAmmoTH/.

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

MathInstruct

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

SVAMP

NumGLUE

MATH

MATHWELL Human Annotation Dataset

Usage

License

Modalities

Languages

MathInstruct

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

SVAMP

NumGLUE

MATH

MATHWELL Human Annotation Dataset

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages