Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Community Discovery from Social Media by Low-Rank Matrix Recovery

Published: 23 January 2015 Publication History

Abstract

The pervasive usage and reach of social media have attracted a surge of attention in the multimedia research community. Community discovery from social media has therefore become an important yet challenging issue. However, due to the subjective generating process, the explicitly observed communities (e.g., group-user and user-user relationship) are often noisy and incomplete in nature. This paper presents a novel approach to discovering communities from social media, including the group membership and user friend structure, by exploring a low-rank matrix recovery technique. In particular, we take Flickr as one exemplary social media platform. We first model the observed indicator matrix of the Flickr community as a summation of a low-rank true matrix and a sparse error matrix. We then formulate an optimization problem by regularizing the true matrix to coincide with the available rich context and content (i.e., photos and their associated tags). An iterative algorithm is developed to recover the true community indicator matrix. The proposed approach leads to a variety of social applications, including community visualization, interest group refinement, friend suggestion, and influential user identification. The evaluations on a large-scale testbed, consisting of 4,919 Flickr users, 1,467 interest groups, and over five million photos, show that our approach opens a new yet effective perspective to solve social network problems with sparse learning technique. Despite being focused on Flickr, our technique can be applied in any other social media community.

References

[1]
Lada A. Adamic and Eytan Adar. 2001. Friends and neighbors on the web. SOCIAL NETWORKS 25 (2001), 211--230.
[2]
Lars Backstrom, Daniel P. Huttenlocher, Jon M. Kleinberg, and Xiangyang Lan. 2006. Group formation in large social networks: Membership, growth, and evolution. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 44--54.
[3]
Jian-Feng Cai, Emmanuel J. Candès, and Zuowei Shen. 2010. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization 20, 4 (2010), 1956--1982.
[4]
Munmun De Choudhury, Winter A. Mason, Jake M. Hofman, and Duncan J. Watts. 2010. Inferring relevant social networks from interpersonal communication. In Proceedings of the 19th International Conference on World Wide Web. 301--310.
[5]
David J. Crandall, Dan Cosley, Daniel P. Huttenlocher, Jon M. Kleinberg, and Siddharth Suri. 2008. Feedback effects between similarity and social influence in online communities. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 160--168.
[6]
Eric Gilbert and Karrie Karahalios. 2009. Predicting tie strength with social media. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 211--220.
[7]
Ido Guy, Naama Zwerdling, Inbal Ronen, David Carmel, and Erel Uziel. 2010. Social media recommendation based on people and tags. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 194--201.
[8]
Lyndon S. Kennedy, Mor Naaman, Shane Ahern, Rahul Nair, and Tye Rattenbury. 2007. How Flickr helps us make sense of the world: Context and content in community-contributed media collections. In Proceedings of the 15th International Conference on Multimedia. 631--640.
[9]
Irwin King, Michael R. Lyu, and Hao Ma. 2010. Introduction to social recommendation. In Proceedings of the 19th International Conference on World Wide Web. 1355--1356.
[10]
Ioannis Konstas, Vassilios Stathopoulos, and Joemon M. Jose. 2009. On social networks and collaborative recommendation. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 195--202.
[11]
Jure Leskovec, Daniel P. Huttenlocher, and Jon M. Kleinberg. 2010. Predicting positive and negative links in online social networks. In Proceedings of the 19th International Conference on World Wide Web. 641--650.
[12]
Z. Lin, A. Ganesh, J. Wright, L. Wu, M. Chen, and Y. Ma. 2009. Fast Convex Optimization Algorithms for Exact Recovery of a Corrupted Low-Rank Matrix. Technical Report UILU-ENG-09-2214. UIUC. http://perception.csl.illinois.edu/matrix-rank/Files/rpca_algorithms.pdf.
[13]
David G. Lowe. 2004. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 2 (2004), 91--110.
[14]
Tao Mei, Yong Rui, Shipeng Li, and Qi Tian. 2014. Multimedia search reranking: A literature survey. Computer Surveys 46, 3 (Sept. 2014).
[15]
Tao Mei, Bo Yang, Xian-Sheng Hua, and Shipeng Li. 2011. Contextual video recommendation by multimodal relevance and user feedback. ACM Transactions on Information Systems 29, 2 (April 2011).
[16]
Radu Andrei Negoescu, Brett Adams, Dinh Q. Phung, Svetha Venkatesh, and Daniel Gatica-Perez. 2009. Flickr hypergroups. In Proceedings of the 17th ACM International Conference on Multimedia. 813--816.
[17]
Radu Andrei Negoescu and Daniel Gatica-Perez. 2008. Analyzing Flickr groups. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval. 417--426.
[18]
Suman Deb Roy, Tao Mei, Wenjun Zeng, and Shipeng Li. 2013. Towards cross-domain learning for social video popularity prediction. IEEE Transactions on Multimedia 15, 6 (2013), 1255--1267.
[19]
Fabrizio Sebastiani. 2001. Machine learning in automated text categorization. CoRR cs.IR/0110053 (2001). 1--47.
[20]
Pavel Serdyukov, Vanessa Murdock, and Roelof van Zwol. 2009. Placing flickr photos on a map. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 484--491.
[21]
P. Singla, H. Kautz, J. Luo, and A. Gallagher. 2008. Discovery of social relationships in consumer photo collections using Markov logic. In Computer Vision and Pattern Recognition Workshops. 1--7.
[22]
Parag Singla and Matthew Richardson. 2008. Yes, there is a correlation: From social networks to personal behavior on the web. In Proceedings of the 17th International Conference on World Wide Web. 655--664.
[23]
Benjamin Taskar, Ming Fai Wong, Pieter Abbeel, and Daphne Koller. 2003. Link prediction in relational data. In Neural Information Processing Systems (NIPS'O3).
[24]
Changhu Wang, Shuicheng Yan, Lei Zhang, and Hong-Jiang Zhang. 2009. Multi-label sparse coding for automatic image annotation. In Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'9). 1643--1650.
[25]
John Wright, Arvind Ganesh, Shankar Rao, and Yi Ma. 2009. Robust principal component analysis: Exact recovery of corrupted low-rank matrices. CoRR abs/0905.0233 (2009).
[26]
Lei Wu, Steven C. H. Hoi, Rong Jin, Jianke Zhu, and Nenghai Yu. 2009. Distance metric learning from uncertain side information with application to automated photo tagging. In Proceedings of the 17th ACM International Conference on Multimedia (MM'08). 135--144.
[27]
Lei Wu, Xian-Sheng Hua, Nenghai Yu, Wei-Ying Ma, and Shipeng Li. 2008. Flickr distance. In Proceedings of the 16th ACM International Conference on Multimedia (MM'08). 31--40.
[28]
Peng Wu and Daniel Tretter. 2009. Close & closer: Social cluster and closeness from photo collections. In Proceedings of the 17th ACM International Conference on Multimedia (MM'09). 709--712.
[29]
T. Yang, R. Jin, Y. Chi, and S. Zhu. 2009. Combining link and content for community detection: A discriminative approach. In KDD. 927--935.
[30]
Ting Yao, Yuan Liu, Chong-Wah Ngo, and Tao Mei. 2013. Unified entity search in social media community. In Proceedings of the 22nd International Conference on World Wide Web (WWW'13). 1457--1466.
[31]
Jie Yu, Dhiraj Joshi, and Jiebo Luo. 2009. Connecting people in photo-sharing sites by photo content and user annotations. In Proceeding of the IEEE International Conference on Multimedia and Expo (ICME'09). 1464--1467.
[32]
Guangyu Zhu, Shucheng Yan, and Yi Ma. 2010. Image tag refinement towards low-rank, content-tag prior and error sparsity. In Proceedings of the International Conference on Multimedia (MM'10). 461--470.
[33]
Xiaojin Zhu. 2005. Semi-Supervised Learning Literature Survey. Technical Report 1530. Computer Sciences, University of Wisconsin-Madison. http://pages.cs.wisc.edu/∼jerryzhu/pub/ssl_survey.pdf.
[34]
Jinfeng Zhuang and Steven C. H. Hoi. 2011. A two-view learning approach for image tag ranking. In Proceedings of the fourth ACM International Conference on Web Search and Data Mining (WSDM'11). 625--634.
[35]
Jinfeng Zhuang, Tao Mei, Steven C. H. Hoi, Xian-Sheng Hua, and Shipeng Li. 2011. Modeling social strength in social media community via kernel-based learning. In Proceedings of the 19th ACM International Conference on Multimedia (MM'11). 113--122.

Cited By

View all

Index Terms

  1. Community Discovery from Social Media by Low-Rank Matrix Recovery

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Intelligent Systems and Technology
    ACM Transactions on Intelligent Systems and Technology  Volume 5, Issue 4
    Special Sections on Diversity and Discovery in Recommender Systems, Online Advertising and Regular Papers
    January 2015
    390 pages
    ISSN:2157-6904
    EISSN:2157-6912
    DOI:10.1145/2699158
    • Editor:
    • Huan Liu
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 January 2015
    Revised: 01 October 2014
    Accepted: 01 January 2014
    Received: 01 March 2013
    Published in TIST Volume 5, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Social networks
    2. community discovery
    3. context information
    4. low-rank matrix
    5. social media

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 15 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Real-Time Focused Extraction of Social Media UsersIEEE Access10.1109/ACCESS.2022.316897710(42607-42622)Online publication date: 2022
    • (2020)Predicting the popularity of micro-videos via a feature-discrimination transductive modelMultimedia Systems10.1007/s00530-020-00660-xOnline publication date: 14-Jun-2020
    • (2019)Multimodal Learning toward Micro-Video UnderstandingSynthesis Lectures on Image, Video, and Multimedia Processing10.2200/S00938ED1V01Y201907IVM0209:4(1-186)Online publication date: 17-Sep-2019
    • (2019)A Visual Perspective for User Identification Based on Camera FingerprintImage and Graphics10.1007/978-3-030-34110-7_5(52-63)Online publication date: 23-Aug-2019
    • (2018)Tibetan Weibo User Group Division Based on Semantic Information in the Era of Big DataComplexity10.1155/2018/58090432018Online publication date: 5-Aug-2018
    • (2018)Low-Rank Multi-View Embedding Learning for Micro-Video Popularity PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.278578430:8(1519-1532)Online publication date: 1-Aug-2018
    • (2018)Network Estimation from Point Process DataIEEE Transactions on Information Theory10.1109/TIT.2018.2875766(1-1)Online publication date: 2018
    • (2018)An improved density peaks-based clustering method for social circle discovery in social networksNeurocomputing10.1016/j.neucom.2015.11.091179:C(219-227)Online publication date: 31-Dec-2018
    • (2018)A new measure for community structures through indirect social connectionsExpert Systems with Applications10.1016/j.eswa.2018.07.040114(196-209)Online publication date: Dec-2018
    • (2016)Atom Decomposition with Adaptive Basis Selection Strategy for Matrix CompletionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/290371612:3(1-25)Online publication date: 15-Jun-2016

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media