Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1835804.1835855acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Cold start link prediction

Published: 25 July 2010 Publication History

Abstract

In the traditional link prediction problem, a snapshot of a social network is used as a starting point to predict, by means of graph-theoretic measures, the links that are likely to appear in the future. In this paper, we introduce cold start link prediction as the problem of predicting the structure of a social network when the network itself is totally missing while some other information regarding the nodes is available. We propose a two-phase method based on the bootstrap probabilistic graph. The first phase generates an implicit social network under the form of a probabilistic graph. The second phase applies probabilistic graph-based measures to produce the final prediction. We assess our method empirically over a large data collection obtained from Flickr, using interest groups as the initial information. The experiments confirm the effectiveness of our approach.

Supplementary Material

JPG File (kdd2010_leroy_cslp_01.jpg)
MOV File (kdd2010_leroy_cslp_01.mov)

References

[1]
L. Adamic and E. Adar. Friends and neighbors on the web. Soc. Networks, 25(3), 2003.
[2]
L. Backstrom, C. Dwork, and J. Kleinberg. Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography. In Proc. of the Sixteenth Int. Conf. on World Wide Web (WWW'07). pages 181--190, 2007.
[3]
J. Baumes, M. Goldberg, M. Hayvanovych, M. Magdon-Ismail, W. Wallace, and M. Zaki. Finding hidden group structure in a stream of communications. In Proc. of the IEEE Int. Conf. on Intelligence and Security Informatics, (ISI'06), pages 201--212, 2006.
[4]
M. Cha, A. Mislove, and K. Gummadi. A measurement-driven analysis of information propagation in the Flickr social network. In Proc. of the Eighteenth Int. Conf. on World Wide Web (WWW'09), pages 721--730, 2009.
[5]
R. Chellappa and A. Jain. Markov random fields: theory and application. Boston Academic Press, 1993.
[6]
A. Clauset, C. Moore, and M. Newman. Hierarchical structure and the prediction of missing links in networks. Nature, 453:98--101, 2008.
[7]
P. Domingos and M. Richardson. Mining the network value of customers. In Proc. of the Seventh ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'01), pages 57--66, 2001.
[8]
P. Domingos and M. Richardson. Markov Logic: A unifying framework for statistical relational learning. In Proc. of the ICML'04 Workshop on Statistical Relational Learning and its Connections to Other Fields, pages 49--54, 2004.
[9]
N. Eagle and A. Pentland. Reality mining: Sensing complex social systems. Personal and Ubiquitous Computing, 10(4):255--268, 2006.
[10]
D. Fogaras, B. Racz, K. Csalogany, and T. Sarlos. Towards scaling fully personalized pagerank: Algorithms, lower bounds, and experiments. Journal of Internet Mathematics, 2(3):333--358, 2005.
[11]
J. Gao and P.-N. Tan. Converting output scores from outlier detection algorithms into probability estimates. In Proc. of the Sixth IEEE Int. Conf. on Data Mining (ICDM'06), pages 212--221, 2006.
[12]
L. Getoor, N. Friedman, D. Koller, and B. Taskar. Learning probabilistic models of link structure. Machine Learning, 3:679--707, 2003.
[13]
M. Hay, G. Miklau, D. Jensen, D. F. Towsley, and P. Weis. Resisting structural re-identification in anonymized social networks. PVLDB, 1(1):102--114, 2008.
[14]
D. Kempe, J. M. Kleinberg, and E. Tardos. Maximizing the spread of influence through a social network. In Proc. of the Ninth ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'03), pages 137--146, 2003.
[15]
D. Liben-Nowell and J. Kleinberg. The link-prediction problem for social networks. Journal of the American Society for Information Science and Technology, 58(7):1019--1031, 2007.
[16]
A. Narayanan and V. Shmatikov. De-anonymizing social networks. In Proc. of the Thirtieth IEEE Symp. on Security and Privacy, pages 173--187, 2009.
[17]
J. O'Madadhain, J. Hutchins, and P. Smyth. Prediction and ranking algorithms for event-based network data. ACM SIGKDD Exploration Newsletter, 7(2):23--30, 2005.
[18]
M. Potamias, F. Bonchi, A. Gionis, and G. Kollios. k-nearest neighbors in uncertain graphs. In Proc. of the VLDB, the Thirty-Sixth Int. Conf. on Very Large Databases (PVLDB 2010), volume 3, 2010.
[19]
F. J. Provost, T. Fawcett, and R. Kohavi. The case against accuracy estimation for comparing induction algorithms. In Proceedings of the Fifteenth Int. Conf. on Machine Learning (ICML'98), pages 445--453, 1998.
[20]
A. Singla and I. Weber. Camera brand congruence in the Flickr social graph. In Proc. of the Second ACM Int. Conf. on Web Search and Data Mining (WSDM'09), pages 252--261, 2009.
[21]
B. Taskar, M. Wong, P. Abbeel, and D. Koller. Link prediction in relational data. In Neural Information Processing Systems, volume 15, 2003.
[22]
W. Van Der Aalst, H. Reijers, and M. Song. Discovering social networks from event logs. Computer Supported Cooperative Work, 14(6):549--593, 2005.
[23]
B. Zadrozny and C. Elkan. Obtaining calibrated probability estimates from decision trees and naive bayesian classifiers. In Proc. of the Eighteenth Int. Conf. on Machine Learning (ICML'01), pages 609--616, 2001.
[24]
B. Zadrozny and C. Elkan. Transforming classifier scores into accurate multiclass probability estimates. In Proc. of the Eighth ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'02), pages 694--699, 2002.
[25]
J. Zhang and Y. Yang. Probabilistic score estimation with piecewise logistic regression. In Proc. of the Twenty-first Int. Conf. on Machine Learning (ICML'04), page 115, 2004.
[26]
E. Zheleva and L. Getoor. To join or not to join: the illusion of privacy in social networks with mixed public and private user profiles. In Proc. of the Eighteenth Int. Conf. on World Wide Web (WWW'09), pages 531--540, 2009.

Cited By

View all
  • (2024)Predicting hyperlinks via weighted hypernetwork loop structureThe European Physical Journal Special Topics10.1140/epjs/s11734-024-01145-y233:4(863-877)Online publication date: 14-Mar-2024
  • (2024)IM-META: Influence Maximization Using Node Metadata in Networks With Unknown TopologyIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.336290311:3(3148-3160)Online publication date: May-2024
  • (2024)Disentangled Hierarchical Attention Graph Neural Network for RecommendationAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5663-6_35(415-426)Online publication date: 1-Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
July 2010
1240 pages
ISBN:9781450300551
DOI:10.1145/1835804
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. link prediction
  2. probabilistic graph
  3. social networks

Qualifiers

  • Research-article

Conference

KDD '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)1
Reflects downloads up to 22 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Predicting hyperlinks via weighted hypernetwork loop structureThe European Physical Journal Special Topics10.1140/epjs/s11734-024-01145-y233:4(863-877)Online publication date: 14-Mar-2024
  • (2024)IM-META: Influence Maximization Using Node Metadata in Networks With Unknown TopologyIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.336290311:3(3148-3160)Online publication date: May-2024
  • (2024)Disentangled Hierarchical Attention Graph Neural Network for RecommendationAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5663-6_35(415-426)Online publication date: 1-Aug-2024
  • (2023)Health insurance fraud detection by using an attributed heterogeneous information network with a hierarchical attention mechanismBMC Medical Informatics and Decision Making10.1186/s12911-023-02152-023:1Online publication date: 6-Apr-2023
  • (2023)TDAN: Transferable Domain Adversarial Network for Link Prediction in Heterogeneous Social NetworksACM Transactions on Knowledge Discovery from Data10.1145/361022918:1(1-22)Online publication date: 6-Sep-2023
  • (2023)Search Behavior Prediction: A Hypergraph PerspectiveProceedings of the Sixteenth ACM International Conference on Web Search and Data Mining10.1145/3539597.3570403(697-705)Online publication date: 27-Feb-2023
  • (2023)Heterogeneous Graph Neural Network Based on Hierarchical Attention Mechanism2023 IEEE International Conference on Electrical, Automation and Computer Engineering (ICEACE)10.1109/ICEACE60673.2023.10441928(485-489)Online publication date: 29-Dec-2023
  • (2022)Scatter-GNN: A Scatter Graph Neural Network for Prediction of High-Speed Railway Station—A Case Study of Yinchuan–Chongqing HSRApplied Sciences10.3390/app1301015013:1(150)Online publication date: 22-Dec-2022
  • (2022)META-CODE: Community Detection via Exploratory Learning in Topologically Unknown NetworksProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557639(4034-4038)Online publication date: 17-Oct-2022
  • (2022)Heterogeneous graph neural networks with denoising for graph embeddingsKnowledge-Based Systems10.1016/j.knosys.2021.107899238:COnline publication date: 28-Feb-2022
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media