research-article

Open access

ARM-Net: Adaptive Relation Modeling Network for Structured Data

Authors:

Meihui ZhangAuthors Info & Claims

SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data

Pages 207 - 220

https://doi.org/10.1145/3448016.3457321

Published: 18 June 2021 Publication History

PDF eReader

Abstract

Relational databases are the de facto standard for storing and querying structured data, and extracting insights from structured data requires advanced analytics. Deep neural networks (DNNs) have achieved super-human prediction performance in particular data types, e.g., images. However, existing DNNs may not produce meaningful results when applied to structured data. The reason is that there are correlations and dependencies across combinations of attribute values in a table, and these do not follow simple additive patterns that can be easily mimicked by a DNN. The number of possible such cross features is combinatorial, making them computationally prohibitive to model. Furthermore, the deployment of learning models in real-world applications has also highlighted the need for interpretability, especially for high-stakes applications, which remains another issue of concern to DNNs. In this paper, we present ARM-Net, an adaptive relation modeling network tailored for structured data, and a lightweight framework ARMOR based on ARM-Net for relational data analytics. The key idea is to model feature interactions with cross features selectively and dynamically, by first transforming the input features into exponential space, and then determining the interaction order and interaction weights adaptively for each cross feature. We propose a novel sparse attention mechanism to dynamically generate the interaction weights given the input tuple, so that we can explicitly model cross features of arbitrary orders with noisy features filtered selectively. Then during model inference, ARM-Net can specify the cross features being used for each prediction for higher accuracy and better interpretability. Our extensive experiments on real-world datasets demonstrate that ARM-Net consistently outperforms existing models and provides more interpretable predictions for data-driven decision making.

Supplementary Material

MP4 File (3448016.3457321.mp4)

Relational databases are the de facto standard for storing and querying structured data, and extracting insights from structured data requires advanced analytics. Deep neural networks (DNNs) have achieved super-human prediction performance in particular data types, e.g., images. However, existing DNNs may not produce meaningful results when applied to structured data. The reason is that there are correlations and dependencies across combinations of attribute values in a table, and these do not follow a simple geometric pattern that can be mimicked by a DNN. The number of possible such ``cross features'' is combinatorial, making them computationally prohibitive to model. Furthermore, the deployment of learning models in real-world applications has highlighted the need for interpretability, especially for high-stakes applications, which remains a major drawback for many DNNs.In this paper, we present ARM-Net, an adaptive relation modeling network tailored for structured data, and a lightweight framework ARMOR based on ARM-Net for relational data analytics, which is designed to be accurate, efficient and interpretable. The key idea is to model feature interactions with cross features selectively and dynamically, by first transforming the input features into exponential space, and then determining interaction weights and the interaction order adaptively for each cross feature. We propose a novel sparse attention mechanism to dynamically generate the interaction weights given the input tuple, so that we can model cross features of arbitrary orders and selectively filter noisy features. Then during model inference, ARM-Net can identify the most informative cross features in an input-aware manner for more accurate prediction and better interpretability. Our extensive experiments on real-world datasets show that ARM-Net consistently outperforms existing models and provides interpretable predictions for data-driven decision making.

Download
43.81 MB

References

[1]

Dario Amodei, Sundaram Ananthanarayanan, Rishita Anubhai, Jingliang Bai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Qiang Cheng, Guoliang Chen, et almbox. 2016. Deep speech 2: End-to-end speech recognition in english and mandarin. In International conference on machine learning. 173--182.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Regularized Pairwise Relationship based Analytics for Structured Data

LW-Net: an interpretable network with smart lifting wavelet kernel for mechanical feature extraction and fault diagnosis

Towards web-scale structured web data extraction

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations