Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3397271.3401227acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Neural Mathematical Solver with Enhanced Formula Structure

Published: 25 July 2020 Publication History

Abstract

Automatically answering mathematical problems is a challenging task since it requires not only the ability of linguistic understanding but also mathematical comprehension. Existing studies usually explore solutions on the elementary math word problems that aim to understand the questions described in natural language narratives, which are not capable of solving more general problems containing structural formulas. To this end, in this paper, we propose a novel Neural Mathematical Solver (NMS) with enhanced formula structures. Specifically, we first frame the formulas in a certain problem as a TeX dependency graph to preserve formula-enriched structures. Then, we design a formula graph network (FGN) to capture its mathematical relations. Next, we develop a novel architecture with two GRU models, connecting tokens from both word space and formula space together, to learn the linguistic semantics for the answers. Extensive experiments on a large-scale dataset demonstrate that NMS not only achieves better answer prediction but also visualizes reasonable mathematical representations of problems.

References

[1]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In 3rd International Conference on Learning Representations, ICLR 2015.
[2]
Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. In NIPS 2014 Workshop on Deep Learning, December 2014.
[3]
Zhenya Huang, Qi Liu, Chengxiang Zhai, Yu Yin, Enhong Chen, Weibo Gao, and Guoping Hu. 2019. Exploring Multi-Objective Exercise Recommendations in Online Education Systems. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 1261--1270.
[4]
Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015).
[5]
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research, Vol. 9, Nov (2008), 2579--2605.
[6]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111--3119.
[7]
Subhro Roy and Dan Roth. 2015. Solving General Arithmetic Word Problems. In Proceedings of the 2015 Conference on EMNLP. 1743--1752.
[8]
Adam Santoro, Ryan Faulkner, David Raposo, Jack Rae, Mike Chrzanowski, Theophane Weber, Daan Wierstra, Oriol Vinyals, Razvan Pascanu, and Timothy Lillicrap. 2018. Relational recurrent neural networks. In Advances in neural information processing systems. 7299--7310.
[9]
David Saxton, Edward Grefenstette, Felix Hill, and Pushmeet Kohli. 2019. Analysing mathematical reasoning abilities of neural models. arXiv preprint arXiv:1904.01557 (2019).
[10]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.
[11]
Hao Wang, Enhong Chen, Qi Liu, Tong Xu, Dongfang Du, Wen Su, and Xiaopeng Zhang. 2018. A United Approach to Learning Sparse Attributed Network Embedding. In 2018 IEEE International Conference on Data Mining (ICDM). IEEE, 557--566.
[12]
Hao Wang, Tong Xu, Qi Liu, Defu Lian, Enhong Chen, Dongfang Du, Han Wu, and Wen Su. 2019. MCNE: An End-to-End Framework for Learning Multiple Conditional Network Representations of Social Network. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1064--1072.
[13]
Zhipeng Xie and Shichao Sun. 2019. A goal-driven tree-structured neural model for math word problems. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. AAAI Press, 5299--5305.
[14]
Yu Yin, Zhenya Huang, Enhong Chen, Qi Liu, Fuzheng Zhang, Xing Xie, and Guoping Hu. 2018. Transcribing content from structural images with spotlight mechanism. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2643--2652.
[15]
Dongxiang Zhang, Lei Wang, Luming Zhang, Bing Tian Dai, and Heng Tao Shen. 2019. The gap of semantic parsing: A survey on automatic math word problem solvers. IEEE transactions on pattern analysis and machine intelligence (2019).

Cited By

View all
  • (2024)Learning to solve geometry problems via simulating human dual-reasoning processProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/725(6559-6568)Online publication date: 3-Aug-2024
  • (2024)Learning Relation-Enhanced Hierarchical Solver for Math Word ProblemsIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.327211435:10(13830-13844)Online publication date: Oct-2024
  • (2024)Knowledge-Associated Embedding for Memory-Aware Knowledge TracingIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.330690911:3(4016-4028)Online publication date: Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2020
2548 pages
ISBN:9781450380164
DOI:10.1145/3397271
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. formula
  2. mathematical comprehension
  3. mathematical problem

Qualifiers

  • Short-paper

Funding Sources

Conference

SIGIR '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Learning to solve geometry problems via simulating human dual-reasoning processProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/725(6559-6568)Online publication date: 3-Aug-2024
  • (2024)Learning Relation-Enhanced Hierarchical Solver for Math Word ProblemsIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.327211435:10(13830-13844)Online publication date: Oct-2024
  • (2024)Knowledge-Associated Embedding for Memory-Aware Knowledge TracingIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.330690911:3(4016-4028)Online publication date: Jun-2024
  • (2024)EBERT: A lightweight expression-enhanced large-scale pre-trained language model for mathematics educationKnowledge-Based Systems10.1016/j.knosys.2024.112118300(112118)Online publication date: Sep-2024
  • (2023)A Mathematical Word Problem Generator with Structure Planning and Knowledge EnhancementProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591937(1750-1754)Online publication date: 19-Jul-2023
  • (2023)PQSCT: Pseudo-Siamese BERT for Concept Tagging With Both Questions and SolutionsIEEE Transactions on Learning Technologies10.1109/TLT.2023.327570716:5(831-846)Online publication date: 1-Oct-2023
  • (2023)STSN: A Skim-Then-Scrutinize Network for Understanding Multimodal Educational Questions2023 3rd International Conference on Digital Society and Intelligent Systems (DSInS)10.1109/DSInS60115.2023.10455419(405-409)Online publication date: 10-Nov-2023
  • (2022)Expression Syntax Information Bottleneck for Math Word ProblemsProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531824(2166-2171)Online publication date: 6-Jul-2022
  • (2022)HGEN: Learning Hierarchical Heterogeneous Graph Encoding for Math Word Problem SolvingIEEE/ACM Transactions on Audio, Speech, and Language Processing10.1109/TASLP.2022.314531430(816-828)Online publication date: 2022
  • (2022)A Cognitive Solver with Autonomously Knowledge Learning for Reasoning Mathematical Answers2022 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM54844.2022.00037(269-278)Online publication date: Nov-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media