research-article

Public Access

Multi-Type Itemset Embedding for Learning Behavior Success

Authors:

Zachary Eberhart,

Nitesh V. ChawlaAuthors Info & Claims

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 2397 - 2406

https://doi.org/10.1145/3219819.3219949

Published: 19 July 2018 Publication History

Abstract

Contextual behavior modeling uses data from multiple contexts to discover patterns for predictive analysis. However, existing behavior prediction models often face difficulties when scaling for massive datasets. In this work, we formulate a behavior as a set of context items of different types (such as decision makers, operators, goals and resources), consider an observable itemset as a behavior success, and propose a novel scalable method, "multi-type itemset embedding", to learn the context items' representations preserving the success structures. Unlike most of existing embedding methods that learn pair-wise proximity from connection between a behavior and one of its items, our method learns item embeddings collectively from interaction among all multi-type items of a behavior, based on which we develop a novel framework, LearnSuc, for (1) predicting the success rate of any set of items and (2) finding complementary items which maximize the probability of success when incorporated into an itemset. Extensive experiments demonstrate both effectiveness and efficency of the proposed framework.

Supplementary Material

MP4 File (wang_multi-type_success.mp4)

Download
306.52 MB

References

[1]

Mart'ın Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, and others . 2016. TensorFlow: A System for Large-Scale Machine Learning OSDI, Vol. Vol. 16. 265--283.

Digital Library

[2]

Deepak Agarwal, Bee-Chung Chen, and Bo Long . 2011. Localized factor models for multi-context recommendation KDD. 609--617.

Digital Library

[3]

Alex Beutel, Kenton Murray, Christos Faloutsos, and Alexander J Smola . 2014. Cobafi: collaborative bayesian filtering. In WWW. 97--108.

Digital Library

[4]

Shaosheng Cao, Wei Lu, and Qiongkai Xu . 2015. Grarep: Learning graph representations with global structural information CIKM. ACM, 891--900.

Digital Library

[5]

Pablo Castells, Miriam Fernandez, and David Vallet . 2007. An adaptation of the vector-space model for ontology-based information retrieval. TKDE, Vol. 19, 2 (2007).

Digital Library

[6]

Shiyu Chang, Wei Han, Jiliang Tang, Guo-Jun Qi, Charu C Aggarwal, and Thomas S Huang . 2015. Heterogeneous network embedding via deep architectures KDD. 119--128.

Digital Library

[7]

Ting Chen and Yizhou Sun . 2017. Task-Guided and Path-Augmented Heterogeneous Network Embedding for Author Identification WSDM. 295--304.

Digital Library

[8]

Weizheng Chen, Xianling Mao, Xiangyu Li, Yan Zhang, and Xiaoming Li . 2017. PNE: Label Embedding Enhanced Network Embedding. Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 547--560.

[9]

Nemanja Djuric, Jing Zhou, Robin Morris, Mihajlo Grbovic, Vladan Radosavljevic, and Narayan Bhamidipati . 2015. Hate speech detection with comment embeddings. In WWW. 29--30.

Digital Library

[10]

Yuxiao Dong, Nitesh V Chawla, and Ananthram Swami . 2017. metapath2vec: Scalable Representation Learning for Heterogeneous Networks KDD. 135--144.

Digital Library

[11]

Beyza Ermics, Evrim Acar, and A Taylan Cemgil . 2015. Link prediction in heterogeneous data via generalized coupled tensor factorization. DMKD, Vol. 29, 1 (2015), 203--236.

Digital Library

[12]

Yoav Goldberg and Omer Levy . 2014. word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method. arXiv:1402.3722 (2014).

[13]

Aditya Grover and Jure Leskovec . 2016. node2vec: Scalable feature learning for networks. KDD. 855--864.

Digital Library

[14]

Huan Gui, Jialu Liu, Fangbo Tao, Meng Jiang, Brandon Norick, Lance Kaplan, and Jiawei Han . 2017. Embedding Learning with Events in Heterogeneous Information Networks. TKDE (2017).

[15]

Nathan Halko, Per-Gunnar Martinsson, and Joel A Tropp . 2011. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions. SIAM review, Vol. 53, 2 (2011), 217--288.

Digital Library

[16]

Yuheng Hu, Fei Wang, and Subbarao Kambhampati . 2013. Listening to the Crowd: Automated Analysis of Events via Aggregated Twitter Sentiment. IJCAI. 2640--2646.

Digital Library

[17]

Zhipeng Huang and Nikos Mamoulis . 2017. Heterogeneous Information Network Embedding for Meta Path based Proximity. arXiv:1701.05291 (2017).

[18]

Mohsen Jamali and Laks Lakshmanan . 2013. HeteroMF: recommendation in heterogeneous information networks using context dependent factor models. In WWW. 643--654.

Digital Library

[19]

Meng Jiang, Peng Cui, Fei Wang, Xinran Xu, Wenwu Zhu, and Shiqiang Yang . 2014. Fema: flexible evolutionary multi-faceted analysis for dynamic behavioral pattern discovery KDD. 1186--1195.

Digital Library

[20]

Meng Jiang, Peng Cui, Nicholas Jing Yuan, Xing Xie, and Shiqiang Yang . 2016 a. Little Is Much: Bridging Cross-Platform Behaviors through Overlapped Crowds. AAAI. 13--19.

Digital Library

[21]

Meng Jiang, Christos Faloutsos, and Jiawei Han . 2016 b. Catchtartan: Representing and summarizing dynamic multicontextual behaviors KDD. 945--954.

Digital Library

[22]

Quoc Le and Tomas Mikolov . 2014. Distributed representations of sentences and documents ICML. 1188--1196.

Digital Library

[23]

Defu Lian, Zhenyu Zhang, Yong Ge, Fuzheng Zhang, Nicholas Jing Yuan, and Xing Xie . 2016. Regularized Content-Aware Tensor Factorization Meets Temporal-Aware Location Recommendation ICDM. 1029--1034.

[24]

David Matsumoto . 2007. Culture, context, and behavior. Journal of personality Vol. 75, 6 (2007), 1285--1320.

[25]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean . 2013 a. Efficient estimation of word representations in vector space. arXiv:1301.3781 (2013).

[26]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean . 2013 b. Distributed representations of words and phrases and their compositionality NIPS. 3111--3119.

Digital Library

[27]

Maximilian Nickel and Douwe Kiela . 2017. Poincaré Embeddings for Learning Hierarchical Representations. arXiv:1705.08039 (2017).

[28]

Jeffrey Pennington, Richard Socher, and Christopher Manning . 2014. Glove: Global vectors for word representation. In EMNLP. 1532--1543.

[29]

Bryan Perozzi, Rami Al-Rfou, and Steven Skiena . 2014. Deepwalk: Online learning of social representations KDD. 701--710.

Digital Library

[30]

Ioakeim Perros, Evangelos E Papalexakis, Fei Wang, Richard Vuduc, Elizabeth Searles, Michael Thompson, and Jimeng Sun . 2017. SPARTan: Scalable PARAFAC2 for Large & Sparse Data KDD.

Digital Library

[31]

Benjamin Recht, Christopher Re, Stephen Wright, and Feng Niu . 2011. Hogwild: A lock-free approach to parallelizing stochastic gradient descent NIPS. 693--701.

Digital Library

[32]

Steffen Rendle and Lars Schmidt-Thieme . 2010. Pairwise interaction tensor factorization for personalized tag recommendation WSDM. 81--90.

Digital Library

[33]

Alan Said, Shlomo Berkovsky, and Ernesto W De Luca . 2010. Putting things in context: Challenge on context-aware movie recommendation Workshop on Context-Aware Movie Recommendation. 2--6.

Digital Library

[34]

Jie Tang, Tiancheng Lou, and Jon Kleinberg . 2012. Inferring social ties across heterogeneous networks WSDM. 743--752.

Digital Library

[35]

Jian Tang, Meng Qu, and Qiaozhu Mei . 2015 a. Pte: Predictive text embedding through large-scale heterogeneous text networks KDD. 1165--1174.

Digital Library

[36]

Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei . 2015 b. Line: Large-scale information network embedding. WWW. 1067--1077.

Digital Library

[37]

Laurens Van Der Maaten, Eric Postma, and Jaap Van den Herik . 2009. Dimensionality reduction: a comparative. J Mach Learn Res Vol. 10 (2009), 66--71.

[38]

Daixin Wang, Peng Cui, and Wenwu Zhu . 2016. Structural deep network embedding. In KDD. 1225--1234.

Digital Library

[39]

Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Y Chang . 2015. Network Representation Learning with Rich Text Information. IJCAI. 2111--2117.

Digital Library

[40]

Kai Yang, Xiang Li, Haifeng Liu, Jing Mei, Guo Tong Xie, Junfeng Zhao, Bing Xie, and Fei Wang . 2017. TaGiTeD: Predictive Task Guided Tensor Decomposition for Representation Learning from Electronic Health Records. AAAI. 2824--2830.

Cited By

Li HGu JLu XShen DLiu YDeng YShi GXiong H(2024)Beyond Relevance: Factor-level Causal Explanation for User Travel Decisions with Counterfactual Data AugmentationACM Transactions on Information Systems10.1145/365367342:5(1-31)Online publication date: 29-Apr-2024
https://dl.acm.org/doi/10.1145/3653673
Wang DZhao TYu WChawla NJiang M(2023)Deep Multimodal Complementarity LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.316518034:12(10213-10224)Online publication date: Dec-2023
https://doi.org/10.1109/TNNLS.2022.3165180
Zhao TJiang TShah NJiang M(2022)A Synergistic Approach for Graph Anomaly Detection With Pattern Mining and Feature LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.310260933:6(2393-2405)Online publication date: Jun-2022
https://doi.org/10.1109/TNNLS.2021.3102609
Show More Cited By

Index Terms

Multi-Type Itemset Embedding for Learning Behavior Success
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Unsupervised learning
        Dimensionality reduction and manifold learning
    2. Machine learning approaches
      1. Learning latent representations
2. Information systems
  1. Information systems applications
    1. Decision support systems

Recommendations

TUBE: Embedding Behavior Outcomes for Predicting Success
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Given a project plan and the goal, can we predict the plan's success rate? The key challenge is to learn the feature vectors of billions of the plan's components for effective prediction. However, existing methods did not model the behavior outcomes but ...
Modeling Complementarity in Behavior Data with Multi-Type Itemset Embedding
People are looking for complementary contexts, such as team members of complementary skills for project team building and/or reading materials of complementary knowledge for effective student learning, to make their behaviors more likely to be successful. ...
Behavior Informatics to Discover Behavior Insight for Active and Tailored Client Management
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Behavior is ubiquitous, and behavior intelligence and insight play an important role in data understanding and business problem-solving. Behavior Informatics [1,2] emerges as an important tool for discovering behavior intelligence and behavior insight. ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

July 2018

2925 pages

ISBN:9781450355520

DOI:10.1145/3219819

General Chairs:
Yike Guo
Imperial College London
,
Faisal Farooq
IBM

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Science Foundation
Army Research Laboratory

Conference

KDD '18

Sponsor:

KDD '18: The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 19 - 23, 2018

London, United Kingdom

Acceptance Rates

KDD '18 Paper Acceptance Rate 107 of 983 submissions, 11%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
2,091
Total Downloads

Downloads (Last 12 months)111
Downloads (Last 6 weeks)18

Reflects downloads up to 13 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Li HGu JLu XShen DLiu YDeng YShi GXiong H(2024)Beyond Relevance: Factor-level Causal Explanation for User Travel Decisions with Counterfactual Data AugmentationACM Transactions on Information Systems10.1145/365367342:5(1-31)Online publication date: 29-Apr-2024
https://dl.acm.org/doi/10.1145/3653673
Wang DZhao TYu WChawla NJiang M(2023)Deep Multimodal Complementarity LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.316518034:12(10213-10224)Online publication date: Dec-2023
https://doi.org/10.1109/TNNLS.2022.3165180
Zhao TJiang TShah NJiang M(2022)A Synergistic Approach for Graph Anomaly Detection With Pattern Mining and Feature LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.310260933:6(2393-2405)Online publication date: Jun-2022
https://doi.org/10.1109/TNNLS.2021.3102609
Wang YShen JSong YWang SZhang M(2022)HE-SNE: Heterogeneous Event Sequence-based Streaming Network Embedding for Dynamic Behaviors2022 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN55064.2022.9892872(1-8)Online publication date: 18-Jul-2022
https://doi.org/10.1109/IJCNN55064.2022.9892872
Wang DZeng QChawla NJiang M(2021)Modeling Complementarity in Behavior Data with Multi-Type Itemset EmbeddingACM Transactions on Intelligent Systems and Technology10.1145/345872412:4(1-25)Online publication date: 28-Jun-2021
https://dl.acm.org/doi/10.1145/3458724
Wang DZhang ZMa YZhao TJiang TChawla NJiang M(2021)Modeling Co-evolution of Attributed and Structural Information in Graph SequenceIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.3094332(1-1)Online publication date: 2021
https://doi.org/10.1109/TKDE.2021.3094332
Ji YOhsawa Y(2021)Mining Frequent and Rare Itemsets With Weighted Supports Using Additive Neural Itemset Embedding2021 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN52387.2021.9534070(1-8)Online publication date: 2021
https://doi.org/10.1109/IJCNN52387.2021.9534070
Wang WPan C(2021)Collectively Learned Multi-level Spatial Embeddings for Residential Rental Price Prediction2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671927(274-283)Online publication date: 15-Dec-2021
https://doi.org/10.1109/BigData52589.2021.9671927
Zhu CZhang QCao LAbrahamyan A(2020)Mix2Vec: Unsupervised Mixed Data Representation2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA49011.2020.00024(118-127)Online publication date: Oct-2020
https://doi.org/10.1109/DSAA49011.2020.00024
Alaphat AJiang M(2020)SmartFund: Predicting Research Outcomes with Machine Learning and Natural Language Processing2020 IEEE International Conference on Big Data (Big Data)10.1109/BigData50022.2020.9378206(2857-2865)Online publication date: 10-Dec-2020
https://doi.org/10.1109/BigData50022.2020.9378206
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents