Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3219819.3219949acmotherconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Public Access

Multi-Type Itemset Embedding for Learning Behavior Success

Published: 19 July 2018 Publication History

Abstract

Contextual behavior modeling uses data from multiple contexts to discover patterns for predictive analysis. However, existing behavior prediction models often face difficulties when scaling for massive datasets. In this work, we formulate a behavior as a set of context items of different types (such as decision makers, operators, goals and resources), consider an observable itemset as a behavior success, and propose a novel scalable method, "multi-type itemset embedding", to learn the context items' representations preserving the success structures. Unlike most of existing embedding methods that learn pair-wise proximity from connection between a behavior and one of its items, our method learns item embeddings collectively from interaction among all multi-type items of a behavior, based on which we develop a novel framework, LearnSuc, for (1) predicting the success rate of any set of items and (2) finding complementary items which maximize the probability of success when incorporated into an itemset. Extensive experiments demonstrate both effectiveness and efficency of the proposed framework.

Supplementary Material

MP4 File (wang_multi-type_success.mp4)

References

[1]
Mart'ın Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, and others . 2016. TensorFlow: A System for Large-Scale Machine Learning OSDI, Vol. Vol. 16. 265--283.
[2]
Deepak Agarwal, Bee-Chung Chen, and Bo Long . 2011. Localized factor models for multi-context recommendation KDD. 609--617.
[3]
Alex Beutel, Kenton Murray, Christos Faloutsos, and Alexander J Smola . 2014. Cobafi: collaborative bayesian filtering. In WWW. 97--108.
[4]
Shaosheng Cao, Wei Lu, and Qiongkai Xu . 2015. Grarep: Learning graph representations with global structural information CIKM. ACM, 891--900.
[5]
Pablo Castells, Miriam Fernandez, and David Vallet . 2007. An adaptation of the vector-space model for ontology-based information retrieval. TKDE, Vol. 19, 2 (2007).
[6]
Shiyu Chang, Wei Han, Jiliang Tang, Guo-Jun Qi, Charu C Aggarwal, and Thomas S Huang . 2015. Heterogeneous network embedding via deep architectures KDD. 119--128.
[7]
Ting Chen and Yizhou Sun . 2017. Task-Guided and Path-Augmented Heterogeneous Network Embedding for Author Identification WSDM. 295--304.
[8]
Weizheng Chen, Xianling Mao, Xiangyu Li, Yan Zhang, and Xiaoming Li . 2017. PNE: Label Embedding Enhanced Network Embedding. Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 547--560.
[9]
Nemanja Djuric, Jing Zhou, Robin Morris, Mihajlo Grbovic, Vladan Radosavljevic, and Narayan Bhamidipati . 2015. Hate speech detection with comment embeddings. In WWW. 29--30.
[10]
Yuxiao Dong, Nitesh V Chawla, and Ananthram Swami . 2017. metapath2vec: Scalable Representation Learning for Heterogeneous Networks KDD. 135--144.
[11]
Beyza Ermics, Evrim Acar, and A Taylan Cemgil . 2015. Link prediction in heterogeneous data via generalized coupled tensor factorization. DMKD, Vol. 29, 1 (2015), 203--236.
[12]
Yoav Goldberg and Omer Levy . 2014. word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method. arXiv:1402.3722 (2014).
[13]
Aditya Grover and Jure Leskovec . 2016. node2vec: Scalable feature learning for networks. KDD. 855--864.
[14]
Huan Gui, Jialu Liu, Fangbo Tao, Meng Jiang, Brandon Norick, Lance Kaplan, and Jiawei Han . 2017. Embedding Learning with Events in Heterogeneous Information Networks. TKDE (2017).
[15]
Nathan Halko, Per-Gunnar Martinsson, and Joel A Tropp . 2011. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions. SIAM review, Vol. 53, 2 (2011), 217--288.
[16]
Yuheng Hu, Fei Wang, and Subbarao Kambhampati . 2013. Listening to the Crowd: Automated Analysis of Events via Aggregated Twitter Sentiment. IJCAI. 2640--2646.
[17]
Zhipeng Huang and Nikos Mamoulis . 2017. Heterogeneous Information Network Embedding for Meta Path based Proximity. arXiv:1701.05291 (2017).
[18]
Mohsen Jamali and Laks Lakshmanan . 2013. HeteroMF: recommendation in heterogeneous information networks using context dependent factor models. In WWW. 643--654.
[19]
Meng Jiang, Peng Cui, Fei Wang, Xinran Xu, Wenwu Zhu, and Shiqiang Yang . 2014. Fema: flexible evolutionary multi-faceted analysis for dynamic behavioral pattern discovery KDD. 1186--1195.
[20]
Meng Jiang, Peng Cui, Nicholas Jing Yuan, Xing Xie, and Shiqiang Yang . 2016 a. Little Is Much: Bridging Cross-Platform Behaviors through Overlapped Crowds. AAAI. 13--19.
[21]
Meng Jiang, Christos Faloutsos, and Jiawei Han . 2016 b. Catchtartan: Representing and summarizing dynamic multicontextual behaviors KDD. 945--954.
[22]
Quoc Le and Tomas Mikolov . 2014. Distributed representations of sentences and documents ICML. 1188--1196.
[23]
Defu Lian, Zhenyu Zhang, Yong Ge, Fuzheng Zhang, Nicholas Jing Yuan, and Xing Xie . 2016. Regularized Content-Aware Tensor Factorization Meets Temporal-Aware Location Recommendation ICDM. 1029--1034.
[24]
David Matsumoto . 2007. Culture, context, and behavior. Journal of personality Vol. 75, 6 (2007), 1285--1320.
[25]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean . 2013 a. Efficient estimation of word representations in vector space. arXiv:1301.3781 (2013).
[26]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean . 2013 b. Distributed representations of words and phrases and their compositionality NIPS. 3111--3119.
[27]
Maximilian Nickel and Douwe Kiela . 2017. Poincaré Embeddings for Learning Hierarchical Representations. arXiv:1705.08039 (2017).
[28]
Jeffrey Pennington, Richard Socher, and Christopher Manning . 2014. Glove: Global vectors for word representation. In EMNLP. 1532--1543.
[29]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena . 2014. Deepwalk: Online learning of social representations KDD. 701--710.
[30]
Ioakeim Perros, Evangelos E Papalexakis, Fei Wang, Richard Vuduc, Elizabeth Searles, Michael Thompson, and Jimeng Sun . 2017. SPARTan: Scalable PARAFAC2 for Large & Sparse Data KDD.
[31]
Benjamin Recht, Christopher Re, Stephen Wright, and Feng Niu . 2011. Hogwild: A lock-free approach to parallelizing stochastic gradient descent NIPS. 693--701.
[32]
Steffen Rendle and Lars Schmidt-Thieme . 2010. Pairwise interaction tensor factorization for personalized tag recommendation WSDM. 81--90.
[33]
Alan Said, Shlomo Berkovsky, and Ernesto W De Luca . 2010. Putting things in context: Challenge on context-aware movie recommendation Workshop on Context-Aware Movie Recommendation. 2--6.
[34]
Jie Tang, Tiancheng Lou, and Jon Kleinberg . 2012. Inferring social ties across heterogeneous networks WSDM. 743--752.
[35]
Jian Tang, Meng Qu, and Qiaozhu Mei . 2015 a. Pte: Predictive text embedding through large-scale heterogeneous text networks KDD. 1165--1174.
[36]
Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei . 2015 b. Line: Large-scale information network embedding. WWW. 1067--1077.
[37]
Laurens Van Der Maaten, Eric Postma, and Jaap Van den Herik . 2009. Dimensionality reduction: a comparative. J Mach Learn Res Vol. 10 (2009), 66--71.
[38]
Daixin Wang, Peng Cui, and Wenwu Zhu . 2016. Structural deep network embedding. In KDD. 1225--1234.
[39]
Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Y Chang . 2015. Network Representation Learning with Rich Text Information. IJCAI. 2111--2117.
[40]
Kai Yang, Xiang Li, Haifeng Liu, Jing Mei, Guo Tong Xie, Junfeng Zhao, Bing Xie, and Fei Wang . 2017. TaGiTeD: Predictive Task Guided Tensor Decomposition for Representation Learning from Electronic Health Records. AAAI. 2824--2830.

Cited By

View all
  • (2024)Beyond Relevance: Factor-level Causal Explanation for User Travel Decisions with Counterfactual Data AugmentationACM Transactions on Information Systems10.1145/365367342:5(1-31)Online publication date: 29-Apr-2024
  • (2023)Deep Multimodal Complementarity LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.316518034:12(10213-10224)Online publication date: Dec-2023
  • (2022)A Synergistic Approach for Graph Anomaly Detection With Pattern Mining and Feature LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.310260933:6(2393-2405)Online publication date: Jun-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
July 2018
2925 pages
ISBN:9781450355520
DOI:10.1145/3219819
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. behavior data embedding
  2. behavior modeling
  3. itemset embedding
  4. recommender systems
  5. representation learning

Qualifiers

  • Research-article

Funding Sources

Conference

KDD '18
Sponsor:

Acceptance Rates

KDD '18 Paper Acceptance Rate 107 of 983 submissions, 11%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)111
  • Downloads (Last 6 weeks)18
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Beyond Relevance: Factor-level Causal Explanation for User Travel Decisions with Counterfactual Data AugmentationACM Transactions on Information Systems10.1145/365367342:5(1-31)Online publication date: 29-Apr-2024
  • (2023)Deep Multimodal Complementarity LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.316518034:12(10213-10224)Online publication date: Dec-2023
  • (2022)A Synergistic Approach for Graph Anomaly Detection With Pattern Mining and Feature LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.310260933:6(2393-2405)Online publication date: Jun-2022
  • (2022)HE-SNE: Heterogeneous Event Sequence-based Streaming Network Embedding for Dynamic Behaviors2022 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN55064.2022.9892872(1-8)Online publication date: 18-Jul-2022
  • (2021)Modeling Complementarity in Behavior Data with Multi-Type Itemset EmbeddingACM Transactions on Intelligent Systems and Technology10.1145/345872412:4(1-25)Online publication date: 28-Jun-2021
  • (2021)Modeling Co-evolution of Attributed and Structural Information in Graph SequenceIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.3094332(1-1)Online publication date: 2021
  • (2021)Mining Frequent and Rare Itemsets With Weighted Supports Using Additive Neural Itemset Embedding2021 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN52387.2021.9534070(1-8)Online publication date: 2021
  • (2021)Collectively Learned Multi-level Spatial Embeddings for Residential Rental Price Prediction2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671927(274-283)Online publication date: 15-Dec-2021
  • (2020)Mix2Vec: Unsupervised Mixed Data Representation2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA49011.2020.00024(118-127)Online publication date: Oct-2020
  • (2020)SmartFund: Predicting Research Outcomes with Machine Learning and Natural Language Processing2020 IEEE International Conference on Big Data (Big Data)10.1109/BigData50022.2020.9378206(2857-2865)Online publication date: 10-Dec-2020
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media