Learning by Analogy: Diverse Questions Generation in Math Word Problem

Zihao Zhou; Maizhen Ning; Qiufeng Wang; Jie Yao; Wei Wang; Xiaowei Huang; Kaizhu Huang

doi:10.18653/v1/2023.findings-acl.705

Learning by Analogy: Diverse Questions Generation in Math Word Problem

Zihao Zhou, Maizhen Ning, Qiufeng Wang^*, Jie Yao, Wei Wang, Xiaowei Huang, Kaizhu Huang

^*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

11 Citations (Scopus)

Abstract

Solving math word problem (MWP) with AI techniques has recently made great progress with the success of deep neural networks (DNN), but it is far from being solved. We argue that the ability of learning by analogy is essential for an MWP solver to better understand same problems which may typically be formulated in diverse ways. However most existing works exploit the shortcut learning to train MWP solvers simply based on samples with a single question. In lack of diverse questions, these methods merely learn shallow heuristics. In this paper, we make a first attempt to solve MWPs by generating diverse yet consistent questions/equations. Given a typical MWP including the scenario description, question, and equation (i.e., answer), we first generate multiple consistent equations via a group of heuristic rules. We then feed them to a question generator together with the scenario to obtain the corresponding diverse questions, forming a new MWP with a variety of questions and equations. Finally we engage a data filter to remove those unreasonable MWPs, keeping the high-quality augmented ones. To evaluate the ability of learning by analogy for an MWP solver, we generate a new MWP dataset (called DiverseMath23K) with diverse questions by extending the current benchmark Math23K. Extensive experimental results demonstrate that our proposed method can generate high-quality diverse questions with corresponding equations, further leading to performance improvement on Diverse-Math23K. The code and dataset is available at: https://github.com/zhouzihao501/DiverseMWP.

Original language	English
Title of host publication	Findings of the Association for Computational Linguistics, ACL 2023
Publisher	Association for Computational Linguistics (ACL)
Pages	11091-11104
Number of pages	14
ISBN (Electronic)	9781959429623
DOIs	https://doi.org/10.18653/v1/2023.findings-acl.705
Publication status	Published - 2023
Event	61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Toronto, Canada Duration: 9 Jul 2023 → 14 Jul 2023

Publication series

Name	Proceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)	0736-587X

Conference

Conference	61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
Country/Territory	Canada
City	Toronto
Period	9/07/23 → 14/07/23

Access to Document

10.18653/v1/2023.findings-acl.705

Cite this

Zhou, Z., Ning, M., Wang, Q., Yao, J., Wang, W., Huang, X., & Huang, K. (2023). Learning by Analogy: Diverse Questions Generation in Math Word Problem. In Findings of the Association for Computational Linguistics, ACL 2023 (pp. 11091-11104). (Proceedings of the Annual Meeting of the Association for Computational Linguistics). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.705

@inproceedings{5a5b349f343e4135bd208adca011868d,

title = "Learning by Analogy: Diverse Questions Generation in Math Word Problem",

abstract = "Solving math word problem (MWP) with AI techniques has recently made great progress with the success of deep neural networks (DNN), but it is far from being solved. We argue that the ability of learning by analogy is essential for an MWP solver to better understand same problems which may typically be formulated in diverse ways. However most existing works exploit the shortcut learning to train MWP solvers simply based on samples with a single question. In lack of diverse questions, these methods merely learn shallow heuristics. In this paper, we make a first attempt to solve MWPs by generating diverse yet consistent questions/equations. Given a typical MWP including the scenario description, question, and equation (i.e., answer), we first generate multiple consistent equations via a group of heuristic rules. We then feed them to a question generator together with the scenario to obtain the corresponding diverse questions, forming a new MWP with a variety of questions and equations. Finally we engage a data filter to remove those unreasonable MWPs, keeping the high-quality augmented ones. To evaluate the ability of learning by analogy for an MWP solver, we generate a new MWP dataset (called DiverseMath23K) with diverse questions by extending the current benchmark Math23K. Extensive experimental results demonstrate that our proposed method can generate high-quality diverse questions with corresponding equations, further leading to performance improvement on Diverse-Math23K. The code and dataset is available at: https://github.com/zhouzihao501/DiverseMWP.",

author = "Zihao Zhou and Maizhen Ning and Qiufeng Wang and Jie Yao and Wei Wang and Xiaowei Huang and Kaizhu Huang",

note = "Publisher Copyright: {\textcopyright} 2023 Association for Computational Linguistics.; 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 ; Conference date: 09-07-2023 Through 14-07-2023",

year = "2023",

doi = "10.18653/v1/2023.findings-acl.705",

language = "English",

series = "Proceedings of the Annual Meeting of the Association for Computational Linguistics",

publisher = "Association for Computational Linguistics (ACL)",

pages = "11091--11104",

booktitle = "Findings of the Association for Computational Linguistics, ACL 2023",

}

Zhou, Z, Ning, M, Wang, Q, Yao, J, Wang, W, Huang, X & Huang, K 2023, Learning by Analogy: Diverse Questions Generation in Math Word Problem. in Findings of the Association for Computational Linguistics, ACL 2023. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics (ACL), pp. 11091-11104, 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023, Toronto, Canada, 9/07/23. https://doi.org/10.18653/v1/2023.findings-acl.705

Learning by Analogy: Diverse Questions Generation in Math Word Problem. / Zhou, Zihao; Ning, Maizhen; Wang, Qiufeng et al.
Findings of the Association for Computational Linguistics, ACL 2023. Association for Computational Linguistics (ACL), 2023. p. 11091-11104 (Proceedings of the Annual Meeting of the Association for Computational Linguistics).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Learning by Analogy: Diverse Questions Generation in Math Word Problem

AU - Zhou, Zihao

AU - Ning, Maizhen

AU - Wang, Qiufeng

AU - Yao, Jie

AU - Wang, Wei

AU - Huang, Xiaowei

AU - Huang, Kaizhu

PY - 2023

Y1 - 2023

N2 - Solving math word problem (MWP) with AI techniques has recently made great progress with the success of deep neural networks (DNN), but it is far from being solved. We argue that the ability of learning by analogy is essential for an MWP solver to better understand same problems which may typically be formulated in diverse ways. However most existing works exploit the shortcut learning to train MWP solvers simply based on samples with a single question. In lack of diverse questions, these methods merely learn shallow heuristics. In this paper, we make a first attempt to solve MWPs by generating diverse yet consistent questions/equations. Given a typical MWP including the scenario description, question, and equation (i.e., answer), we first generate multiple consistent equations via a group of heuristic rules. We then feed them to a question generator together with the scenario to obtain the corresponding diverse questions, forming a new MWP with a variety of questions and equations. Finally we engage a data filter to remove those unreasonable MWPs, keeping the high-quality augmented ones. To evaluate the ability of learning by analogy for an MWP solver, we generate a new MWP dataset (called DiverseMath23K) with diverse questions by extending the current benchmark Math23K. Extensive experimental results demonstrate that our proposed method can generate high-quality diverse questions with corresponding equations, further leading to performance improvement on Diverse-Math23K. The code and dataset is available at: https://github.com/zhouzihao501/DiverseMWP.

AB - Solving math word problem (MWP) with AI techniques has recently made great progress with the success of deep neural networks (DNN), but it is far from being solved. We argue that the ability of learning by analogy is essential for an MWP solver to better understand same problems which may typically be formulated in diverse ways. However most existing works exploit the shortcut learning to train MWP solvers simply based on samples with a single question. In lack of diverse questions, these methods merely learn shallow heuristics. In this paper, we make a first attempt to solve MWPs by generating diverse yet consistent questions/equations. Given a typical MWP including the scenario description, question, and equation (i.e., answer), we first generate multiple consistent equations via a group of heuristic rules. We then feed them to a question generator together with the scenario to obtain the corresponding diverse questions, forming a new MWP with a variety of questions and equations. Finally we engage a data filter to remove those unreasonable MWPs, keeping the high-quality augmented ones. To evaluate the ability of learning by analogy for an MWP solver, we generate a new MWP dataset (called DiverseMath23K) with diverse questions by extending the current benchmark Math23K. Extensive experimental results demonstrate that our proposed method can generate high-quality diverse questions with corresponding equations, further leading to performance improvement on Diverse-Math23K. The code and dataset is available at: https://github.com/zhouzihao501/DiverseMWP.

UR - http://www.scopus.com/inward/record.url?scp=85169073529&partnerID=8YFLogxK

U2 - 10.18653/v1/2023.findings-acl.705

DO - 10.18653/v1/2023.findings-acl.705

M3 - Conference Proceeding

AN - SCOPUS:85169073529

T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics

SP - 11091

EP - 11104

BT - Findings of the Association for Computational Linguistics, ACL 2023

PB - Association for Computational Linguistics (ACL)

T2 - 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023

Y2 - 9 July 2023 through 14 July 2023

ER -

Zhou Z, Ning M, Wang Q, Yao J, Wang W, Huang X et al. Learning by Analogy: Diverse Questions Generation in Math Word Problem. In Findings of the Association for Computational Linguistics, ACL 2023. Association for Computational Linguistics (ACL). 2023. p. 11091-11104. (Proceedings of the Annual Meeting of the Association for Computational Linguistics). doi: 10.18653/v1/2023.findings-acl.705

Learning by Analogy: Diverse Questions Generation in Math Word Problem

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this