Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Social Link Prediction in Online Social Tagging Systems

Published: 01 November 2013 Publication History
  • Get Citation Alerts
  • Abstract

    Social networks have become a popular medium for people to communicate and distribute ideas, content, news, and advertisements. Social content annotation has naturally emerged as a method of categorization and filtering of online information. The unrestricted vocabulary users choose from to annotate content has often lead to an explosion of the size of space in which search is performed. In this article, we propose latent topic models as a principled way of reducing the dimensionality of such data and capturing the dynamics of collaborative annotation process. We propose three generative processes to model latent user tastes with respect to resources they annotate with metadata. We show that latent user interests combined with social clues from the immediate neighborhood of users can significantly improve social link prediction in the online music social media site Last.fm. Most link prediction methods suffer from the high class imbalance problem, resulting in low precision and/or recall. In contrast, our proposed classification schemes for social link recommendation achieve high precision and recall with respect to not only the dominant class (nonexistence of a link), but also with respect to sparse positive instances, which are the most vital in social tie prediction.

    References

    [1]
    Lars Backstrom and Jure Leskovec. 2011. Supervised random walks: Predicting and recommending links in social networks. In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM’11). ACM, New York, 635--644.
    [2]
    David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993--1022.
    [3]
    Markus Bundschus, Shipeng Yu, Volker Tresp, Achim Rettinger, Mathaeus Dejori, and Hans-Peter Kriegel. 2009. Hierarchical Bayesian models for collaborative tagging systems. In Proceedings of the 9th IEEE International Conference on Data Mining (ICDM’09). IEEE, 728--733.
    [4]
    Iván Cantador, Peter Brusilovsky, and Tsvi Kuflik. 2011. 2nd Workshop on Information Heterogeneity and Fusion in Recommender Systems (HetRec’11). In Proceedings of the 5th ACM Conference on Recommender Systems (RecSys’11). ACM, New York.
    [5]
    Jonathan Chang and David Blei. 2009. Relational topic models for document networks. In Proceedings of the Conference on AI and Statistics.
    [6]
    Nello Cristianini and John Shawe-Taylor. 2010. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press.
    [7]
    Darcy Davis, Ryan Lichtenwalter, and Nitesh V. Chawla. 2011. Multi-relational link prediction in heterogeneous information networks. In Proceedings of the 2011 International Conference on Advances in Social Networks Analysis and Mining (ASONAM’11). IEEE, 281--288.
    [8]
    Laura Dietz. 2009. Modeling shared tastes in online communities. In Proceedings of the NIPS Workshop on Applications for Topic Models: Text and Beyond.
    [9]
    Seyda Ertekin, Jian Huang, Leon Bottou, and Lee Giles. 2007. Learning on the border: Active learning in imbalanced data classification. In Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM’07). ACM, New York. 127--136.
    [10]
    Liang Ge and Aidong Zhang. 2012. Pseudo cold start link prediction with multiple sources in social networks. In Proceedings of the SIAM International Conference on Data Mining. SIAM/Omnipress, 768--779.
    [11]
    Scott Golder and Bernardo A. Huberman. 2006. The structure of collaborative tagging systems. J. Inf. Sci. 32, 2, 198--208.
    [12]
    Mark Granovetter. 1983. The strength of weak ties: A network theory revisited. Sociol. Theory 1, 201--233.
    [13]
    Thomas L. Griffiths and Mark Steyvers. 2004. Finding scientific topics. Proc. Nat. Acad. Sci. 101, Suppl 1, 5228--5235.
    [14]
    Manish Gupta, Rui Li, Zhijun Yin, and Jiawei Han. 2010. Survey on social tagging techniques. SIGKDD Explor. Newsl. 12, 1, 58--72.
    [15]
    Harry Halpin, Valentin Robu, and Hana Shepherd. 2007. The complex dynamics of collaborative tagging. In Proceedings of the 16th International Conference on World Wide Web (WWW’07). ACM, New York, 211--220.
    [16]
    Negar Hariri, Bamshad Mobasher, and Robin Burke. 2012. Context-aware music recommendation based on latenttopic sequential patterns. In Proceedings of the 6th ACM Conference on Recommender Systems (RecSys’12). ACM, New York, 131--138.
    [17]
    Morgan Harvey, Ian Ruthven, and Mark J. Carman. 2011. Improving social bookmark search using personalised latent variable language models. In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM’11). ACM, New York, 485--494.
    [18]
    Peter D. Hoff. 2009. Multiplicative latent factor models for description and prediction of social networks. Comput. Math. Organ. Theory 15, 4, 261--272.
    [19]
    Donald B. Johnson. 1977. Efficient algorithms for shortest paths in sparse networks. J. ACM 24, 1, 1--13.
    [20]
    Leo Katz. 1953. A new status index derived from sociometric analysis. Psychometrika 18, 1, 39--43.
    [21]
    S. Sathiya Keerthi, Olivier Chapelle, and Dennis DeCoste. 2006. Building support vector machines with reduced classifier complexity. J. Mach. Learn. Res. 7, 1493--1515.
    [22]
    Tamara G. Kolda and Brett W. Bader. 2009. Tensor decompositions and applications. SIAM Rev. 51, 3, 455--500.
    [23]
    Ralf Krestel, Peter Fankhauser, and Wolfgang Nejdl. 2009. Latent Dirichlet allocation for tag recommendation. In Proceedings of the 3rd ACM Conference on Recommender Systems (RecSys’09). ACM, New York, 61--68.
    [24]
    Kristina Lerman and Anon Plangprasopchok. 2009. Handbook of Research on Web 2.0, 3.0, and X.0: Technologies, Business, and Social Applications. IGI Global, Chapter Leveraging user-specified metadata to personalize image search.
    [25]
    Vincent Leroy, B. Barla Cambazoglu, and Francesco Bonchi. 2010. Cold start link prediction. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’10). ACM, New York, 393--402.
    [26]
    Nan Lin, Daifeng Li, Ying Ding, Bing He, Zheng Qin, Jie Tang, Juanzi Li, and Tianxi Dong. 2012. The dynamic features of delicious, Flickr, and YouTube. J. Amer. Soc. Inf. Sci. Technol. 63, 1, 139--162.
    [27]
    Marek Lipczak, Borkur Sigurbjornsson, and Alejandro Jaimes. 2012. Understanding and leveraging tag-based relations in on-line social networks. In Proceedings of the 23rd ACM Conference on Hypertext and Social Media (HT’12). ACM, New York, 229--238.
    [28]
    Lu Liu, Feida Zhu, Lei Zhang, and Shiqiang Yang. 2012. A probabilistic graphical model for topic and preference discovery on social media. Neurocomput. 95, 78--88.
    [29]
    Yan Liu, Alexandru Niculescu-Mizil, and Wojciech Gryc. 2009. Topic-link LDA: Joint models of topic and author community. In Proceedings of the 26th Annual International Conference on Machine Learning (ICML’09). ACM, New York, 665--672.
    [30]
    Zhiyuan Liu, Yuzhou Zhang, Edward Y. Chang, and Maosong Sun. 2011. PLDA+: Parallel latent Dirichlet allocation with data placement and pipeline processing. ACM Trans. Intell. Syst. Technol. 2, 3, Article 26.
    [31]
    Bo Long, Xiaoyun Wu, Zhongfei (Mark) Zhang, and Philip S. Yu. 2006. Unsupervised learning on k-partite graphs. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’06). ACM, New York, 317--326.
    [32]
    Caimei Lu, Xiaohua Hu, Xin Chen, Jung-Ran Park, TingTing He, and Zhoujun Li. 2010. The topicperspectivemodel for social tagging systems. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’10). ACM, New York, 683--692.
    [33]
    Linyuan Lu and Tao Zhou. 2011. Link prediction in complex networks: A survey. Physica A: 390, 6, 1150--1170.
    [34]
    Masoud Makrehchi. 2011. Social link recommendation by learning hidden topics. In Proceedings of the 5th ACM Conference on Recommender Systems (RecSys’11). ACM, New York, 189--196.
    [35]
    Miller McPherson, Lynn Smith-Lovin, and James M Cook. 2001. Birds of a Feather: Homophily in Social Networks. Ann. Rev. Sociol. 27, 1, 415--444.
    [36]
    Aditya Krishna Menon and Charles Elkan. 2011. Link prediction via matrix factorization. In Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD’11). Springer, 437--452.
    [37]
    Alan Mislove, Bimal Viswanath, Krishna P. Gummadi, and Peter Druschel. 2010. You are who you know: Inferring user profiles in online social networks. In Proceedings of the 3rd ACM International Conference on Web Search and Data Mining (WSDM’10). ACM, New York, 251--260.
    [38]
    Rohit Parimi and Doina Caragea. 2011. Predicting friendship links in social networks using a topic modeling approach. In Proceedings of the 15th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD’11). Springer, 75--86.
    [39]
    Marco Pennacchiotti and Siva Gurumurthy. 2011. Investigating topic models for social media user recommendation. In Proceedings of the 20th International Conference on World Wide Web (WWW’11). ACM, New York, 101--102.
    [40]
    John C. Platt. 1999. Advances in Kernel Methods. MIT Press, Cambridge, MA, 185--208.
    [41]
    Michal Rosen-Zvi, Chaitanya Chemudugunta, Thomas Griffiths, Padhraic Smyth, and Mark Steyvers. 2010. Learning author-topic models from text corpora. ACM Trans. Inf. Syst. 28, 1, Article 4.
    [42]
    Adam Sadilek, Henry Kautz, and Jeffrey P. Bigham. 2012. Finding your friends and following them to where you are. In Proceedings of the 5th ACM International Conference on Web Search and Data Mining (WSDM’12). ACM, New York, 723--732.
    [43]
    Rossano Schifanella, Alain Barrat, Ciro Cattuto, Benjamin Markines, and Filippo Menczer. 2010. Folks in folksonomies: Social link prediction from shared metadata. In Proceedings of the 3rd ACM International Conference on Web Search and Data Mining (WSDM’10). ACM, New York, 271--280.
    [44]
    Shai Shalev-Shwartz and Nathan Srebro. 2008. SVM optimization: inverse dependence on training set size. In Proceedings of the 25th International Conference on Machine Learning (ICML’08). ACM, New York, 928--935.
    [45]
    Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S. Yu, and Tianyi Wu. 2011. Pathsim: Meta path-based top-k similarity search in heterogeneous information networks. In Proceedings of the International Conference on Very Large Databases.
    [46]
    Ben Taskar, Ming fai Wong, Pieter Abbeel, and Daphne Koller. 2003. Link prediction in relational data. In Neural Information Processing Systems.
    [47]
    Ivor W. Tsang, James T. Kwok, and Pak-Ming Cheung. 2005. Core vector machines: Fast SVM training on very large data sets. J. Mach. Learn. Res. 6, 363--392.
    [48]
    S. Wasserman and K. Faust. 1994. Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge, UK.
    [49]
    Rongjing Xiang, Jennifer Neville, and Monica Rogati. 2010. Modeling relationship strength in online social networks. In Proceedings of the 19th International Conference on World Wide Web (WWW’10). ACM, New York, 981--990.

    Cited By

    View all
    • (2023)Federated search techniques: an overview of the trends and state of the artKnowledge and Information Systems10.1007/s10115-023-01922-665:12(5065-5095)Online publication date: 10-Jul-2023
    • (2021)PILHNB: Popularity, interests, location used hidden Naive Bayesian-based model for link prediction in dynamic social networksNeurocomputing10.1016/j.neucom.2021.02.101Online publication date: Jun-2021
    • (2020)Learning Semantic Representations from Directed Social Links to Tag Microblog Users at ScaleACM Transactions on Information Systems10.1145/337755038:2(1-30)Online publication date: 7-Mar-2020
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Information Systems
    ACM Transactions on Information Systems  Volume 31, Issue 4
    November 2013
    192 pages
    ISSN:1046-8188
    EISSN:1558-2868
    DOI:10.1145/2536736
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 November 2013
    Accepted: 01 June 2013
    Revised: 01 May 2013
    Received: 01 January 2013
    Published in TOIS Volume 31, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Annotation
    2. Last.fm
    3. collaborative tagging
    4. graphical models
    5. link prediction
    6. link recommendation
    7. machine learning
    8. social bookmarking
    9. social media
    10. topic models
    11. unsupervised learning

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • Chevron
    • Center for Interactive Smart Oilfield Technologies (CiSoft)

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)13
    • Downloads (Last 6 weeks)1

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Federated search techniques: an overview of the trends and state of the artKnowledge and Information Systems10.1007/s10115-023-01922-665:12(5065-5095)Online publication date: 10-Jul-2023
    • (2021)PILHNB: Popularity, interests, location used hidden Naive Bayesian-based model for link prediction in dynamic social networksNeurocomputing10.1016/j.neucom.2021.02.101Online publication date: Jun-2021
    • (2020)Learning Semantic Representations from Directed Social Links to Tag Microblog Users at ScaleACM Transactions on Information Systems10.1145/337755038:2(1-30)Online publication date: 7-Mar-2020
    • (2020)Mining User Interests from Social MediaProceedings of the 29th ACM International Conference on Information & Knowledge Management10.1145/3340531.3412167(3519-3520)Online publication date: 19-Oct-2020
    • (2019)Operationalizing regulatory focus in the digital ageMIS Quarterly10.25300/MISQ/2019/1442043:3(745-764)Online publication date: 1-Sep-2019
    • (2019)Extracting, Mining and Predicting Users' Interests from Social NetworksProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331383(1407-1408)Online publication date: 18-Jul-2019
    • (2019)Detecting and Assessing Anomalous Evolutionary Behaviors of Nodes in Evolving Social NetworksACM Transactions on Knowledge Discovery from Data10.1145/329988613:1(1-24)Online publication date: 23-Jan-2019
    • (2019)Social User Interest MiningProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3332279(3235-3236)Online publication date: 25-Jul-2019
    • (2019)User behavior prediction in social networks using weighted extreme learning machine with distribution optimizationFuture Generation Computer Systems10.1016/j.future.2018.04.08593:C(1023-1035)Online publication date: 1-Apr-2019
    • (2018)A tag-based recommender system framework for social bookmarking websitesInternational Journal of Web Based Communities10.5555/3282769.328277514:3(303-322)Online publication date: 1-Jan-2018
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media