research-article

MMSR: : Symbolic regression is a multi-modal information fusion task

Authors:

Yanjie Li,

Jingyi Liu,

Min Wu,

Lina Yu,

Shu WeiAuthors Info & Claims

Volume 114, Issue C

https://doi.org/10.1016/j.inffus.2024.102681

Published: 01 February 2025 Publication History

Abstract

Mathematical formulas are the crystallization of human wisdom in exploring the laws of nature for thousands of years. Describing the complex laws of nature with a concise mathematical formula is a constant pursuit of scientists and a great challenge for artificial intelligence. This field is called symbolic regression (SR). Symbolic regression was originally formulated as a combinatorial optimization problem, and Genetic Programming (GP) and Reinforcement Learning algorithms were used to solve it. However, GP is sensitive to hyperparameters, and these two types of algorithms are inefficient. To solve this problem, researchers treat the mapping from data to expressions as a translation problem. And the corresponding large-scale pre-trained model is introduced. However, the data and expression skeletons do not have very clear word correspondences as the two languages do. Instead, they are more like two modalities (e.g., image and text). Therefore, in this paper, we proposed MMSR. The SR problem is solved as a pure multi-modal problem, and contrastive learning is also introduced in the training process for modal alignment to facilitate later modal feature fusion. It is worth noting that to better promote the modal feature fusion, we adopt the strategy of training contrastive learning loss and other losses at the same time, which only needs one-step training, instead of training contrastive learning loss first and then training other losses. Because our experiments prove training together can make the feature extraction module and feature fusion module wearing-in better. Experimental results show that compared with multiple large-scale pre-training baselines, MMSR achieves the most advanced results on multiple mainstream datasets including SRBench. Our code is open source at https://github.com/1716757342/MMSR.

Highlights

•

Propose a new multi-modal-based approach for symbolic regression.

•

Achieve an effective modality alignment mechanism utilizing contrastive learning.

•

Design a novel joint training strategy based on multi-loss function.

•

Our code is open source at https://github.com/1716757342/MMSR.

References

[1]

Wang Y., Wagner N., Rondinelli J.M., Symbolic regression in materials science, MRS Commun. 9 (3) (2019) 793––805.

Abstract

Highlights

References

Index Terms

Recommendations

Real-time Emotion Pre-Recognition in Conversations with Contrastive Multi-modal Dialogue Pre-training

Deep semi-supervised learning with contrastive learning and partial label propagation for image data

Contrastive learning from label distribution: A case study on text classification

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Share

Share this Publication link

Share on social media

Affiliations