Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3297001.3297060acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
short-paper

Network Sampling Using k-hop Random Walks for Heterogeneous Network Embedding

Published: 03 January 2019 Publication History

Abstract

Capturing neighborhood information by generating node sequences or node samples is an important prerequisite step for many of the neural network embedding approaches. Majority of the recent studies on neural network embedding exploit random walk as a sampling method, which traverses through adjacent neighbors to generate the node sequences. Traversing through only immediate neighbor may not be suitable particularly for heterogeneous information networks (HIN) where adjacent nodes tend to belong to different types. Therefore, this paper proposes a random walk based sampling approach (RW-k) which generates the node sequences such that adjacent nodes in the sequence are separated by k edges preserving the k-hop proximity characteristics. We exploit the node sequences generated using RW-k sampling for network embedding using skip-gram model. Thereafter, the performance of network embedding is evaluated on future co-authorship prediction task over three heterogeneous bibliographic networks. We compare the efficacy of network embedding using proposed RW-k sampling with recently proposed network embedding models based on random walks namely, Metapath2vec, Node2vec and VERSE. It is evident that the RW-k yields better quality of embedding and out-performs baselines in majority of the cases.

References

[1]
Hongyun Cai, Vincent W Zheng, and Kevin Chang. 2018. A comprehensive survey of graph embedding: problems, techniques and applications. TKDE (2018).
[2]
Yuxiao Dong, Nitesh V Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable representation learning for heterogeneous networks. In SIGKDD. 135--144.
[3]
Aditya Grover and Jure Leskovec. 2016. Node2Vec: Scalable Feature Learning for Networks. In SIGKDD. ACM, 855--864.
[4]
Taher H Haveliwala. 2002. Topic-sensitive pagerank. In WWW. ACM, 517--526.
[5]
Xiangnan Kong, Philip S Yu, Ying Ding, and David J Wild. 2012. Meta path-based collective classification in heterogeneous information networks. In CIKM. 1567--1571.
[6]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In NIPS. 3111--3119.
[7]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In SIGKDD. 701--710.
[8]
Anton Tsitsulin, Davide Mottin, Panagiotis Karras, and Emmanuel Müller. 2018. VERSE: Versatile Graph Embeddings from Similarity Measures. In WWW. 539--548.
[9]
Yubao Wu, Xiang Zhang, Yuchen Bian, Zhipeng Cai, Xiang Lian, Xueting Liao, and Fengpan Zhao. 2018. Second-order random walk-based proximity measures in graph analysis: formulations and algorithms. VLDB 27, 1 (2018), 127--152.
[10]
Deqing Yang, Yanghua Xiao, Bo Xu, Hanghang Tong, Wei Wang, and Sheng Huang. 2012. Which topic will you follow?. In ECML PKDD. 597--612.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CODS-COMAD '19: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data
January 2019
380 pages
ISBN:9781450362078
DOI:10.1145/3297001
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 January 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Co-authorship
  2. DBLP
  3. Heterogeneous Network
  4. Network Embedding
  5. Network Sampling
  6. Random Walk

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

CoDS-COMAD '19
CoDS-COMAD '19: 6th ACM IKDD CoDS and 24th COMAD
January 3 - 5, 2019
Kolkata, India

Acceptance Rates

CODS-COMAD '19 Paper Acceptance Rate 62 of 198 submissions, 31%;
Overall Acceptance Rate 197 of 680 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 212
    Total Downloads
  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media