Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Inferring tag co-occurrence relationship across heterogeneous social networks

Published: 01 May 2018 Publication History

Graphical abstract

Display Omitted

Highlights

Study the problem of the tag co-occurrence relationship prediction in Flickr heterogeneous social network.
Present a novel data structure to capture the correlation between images and tags in the Flickr network.
Leverage a weight path-based feature space to illustrate structural information in co-occurrence prediction.
Denote a new similarity measure to capture the subtle similarity semantics between peer objects in heterogeneous networks.
Our method can predict co-occurrence relationship with high accuracy compared with the state-of-the-art link prediction methods.

Abstract

Predicting the occurrence of links or interactions between objects in a network is a fundamental problem in network analysis. In this work, we address a novel problem about tag co-occurrence relationship prediction across heterogeneous networks. Although tag co-occurrence has recently become a hot research topic, many studies mainly focus on how to produce the personalized recommendation leveraging the tag co-occurrence relationship and most of them are considered in a homogeneous network. So far, few studies pay attention to how to predict tag co-occurrence relationship across heterogeneous networks. In order to solve this novel problem mentioned previously, we propose a novel three-step prediction approach. First, image-tag bins are generated by utilizing the TF-IDF like method, which help reduce the search space. And then, weight path-based topological features are systematically extracted from the network. At last, a supervised model is used to learn the best weights associated with different topological features in deciding the co-occurrence relationships. Experiments are performed on real-world dataset, the Flickr network, with comprehensive measurements. Experimental results demonstrate that weight path-based heterogeneous topological features have substantial advantages over commonly used link prediction approaches in predicting co-occurrence relations in Flickr networks.

References

[1]
A.-L. Barabsi, R. Albert, Emergence of scaling in random networks, Science 286 (1999) 509–512.
[2]
J. Leskovec, J.M. Kleinberg, C. Faloutsos, Graphs over time: densification laws, shrinking diameters and possible explanations, KDD’05 (2005) 177–187.
[3]
J. Leskovec, L. Backstrom, R. Kumar, A. Tomkins, Microscopic evolution of social networks, KDD’08 (2008) 462–470.
[4]
M. Sachan, D. Contractor, T. Faruquie, L. Subramaniam, Using content and interactions for discovering communities in social networks, WWW’12 (2012) 331–340.
[5]
W. Lin, X. Kong, P.S. Yu, Q. Wu, Y. Jia, C. Li, Community detection in incomplete information networks, in: Proceedings of the 21st International Conference on World Wide Web, ACM, 2012, pp. 341–350.
[6]
M. Hmimida, R. Kanawati, A graph-coarsening approach for tag recommendation, in: Proceedings of the 25th International Conference Companion on World Wide Web, International World Wide Web Conferences Steering Committee, 2016, pp. 43–44.
[7]
N. Ifada, R. Nayak, How relevant is the irrelevant data: leveraging the tagging data for a learning-to-rank model, in: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, ACM, 2016, pp. 23–32.
[8]
Z. Xu, C. Chen, T. Lukasiewicz, et al., Tag-aware personalized recommendation using a deep-semantic similarity model with negative sampling, in: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, ACM, 2016, pp. 1921–1924.
[9]
Y. Zuo, J. Zeng, M. Gong, et al., Tag-aware recommender systems based on deep neural networks, Neurocomputing 204 (2016) 51–60.
[10]
S. Negi, S. Chaudhury, Link prediction in heterogeneous social networks, in: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, ACM, 2016, pp. 609–617.
[11]
V. Priya, A. Vadivel, User behaviour pattern mining from WebLog, Int. J. Data Wareh. Min. 8 (2) (2012) 1–22.
[12]
M.S. Khan, M. Muyeba, F. Coenen, et al., Finding associations in composite data sets: the CFARM algorithm, Int. J. Data Wareh. Min. 7 (3) (2011) 1–29.
[13]
D. Liben-Nowell, J. Kleinberg, The link prediction problem for social networks, CIKM’03 (2003) 556–559.
[14]
B. Taskar, M. fai Wong, P. Abbeel, D. Koller, Link prediction in relational data, NIPS’03 (2003).
[15]
Y. Sun, R. Barber, M. Gupta, C. Aggarwal, J. Han, Co-author relationship prediction in heterogeneous bibliographic networks, in: Proceedings of 2011 Int. Conf. on Advances in Social Network Analysis and Mining, IEEE, 2011.
[16]
J. Tang, T. Lou, J. Kleinberg, Inferring social ties across heterogeneous networks, in: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, ACM, 2012, pp. 743–752.
[17]
S. Chang, W. Han, J. Tang, et al., Heterogeneous network embedding via deep architectures, in: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2015, pp. 119–128.
[18]
Y. Liu, X. Zeng, Z. He, et al., Inferring microRNA-disease associations by random walk on a heterogeneous network with multiple data sources, IEEE/ACM Trans. Comput. Biol. Bioinform. (2016) pp. (99): 1-1.
[19]
H. Halpin, V. Robu, H. Shepherd, The complex dynamics of collaborative tagging, in: Proceedings of WWW’07, New York, NY, USA, ACM, 2007, pp. 211–220.
[20]
Y. Sun, J. Han, X. Yan, P.S. Yu, T. Wu, PathSim: meta path-based top-k similarity search in heterogeneous information networks, PVLDB 4 (11) (2011) 992–1003.
[21]
J. Chen, H. Gao, Z. Wu, D. Li, Tag co-occurrence relationship prediction in heterogeneous information networks, in: 2013 International Conference on Parallel and Distributed Systems (ICPADS), IEEE, 2013, pp. 528–533.
[22]
F. Zhang, J. Ignatius, Y. Zhao, et al., An improved consensus-based group decision making model with heterogeneous information, Appl. Soft Comput. 35 (2015) 850–863.
[23]
C. Shi, Y. Li, J. Zhang, et al., A survey of heterogeneous information network analysis, IEEE Trans. Knowl. Data Eng. 29 (12) (2015) 87–99.
[24]
C.M. Au Yeung, N. Gibbins, N. Shadbolt, A study of user profile generation from folksonomies, Proceedings of the Workshop on Social Web and Knowledge Management at WWW2008 (2008).
[25]
K. Fukunaga, L. Hostetler, The estimation of the gradient of a density function, with applications in pattern recognition, IEEE Trans. Inf. Theory 21 (1) (1975) 32–40.
[26]
L. Cao, J. Luo, A. Gallagher, X. Jin, J. Han, T.S. Huang, A worldwide tourism recommendation system based on geotagged web photos, ICASSP (2010) 2274–2277.
[27]
D.J. Crandall, L. Backstrom, D. Huttenlocher, J. Kleinberg, Mapping the world's photos, in: Proceedings of the 18th International Conference on World Wide Web, Ser. WWW’09, New York, NY, USA, ACM, 2009, pp. 761–770.
[28]
R.A. Negoescu, D. Gatica-Perez, Analyzing flickr groups, in: Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval, ACM, July 2008, pp. 417–426.
[29]
K. Lerman, A. Plangprasopchok, C. Wong, Personalizing Image Search Results on Flickr, American Association for Artificial Intelligence, 2007.
[30]
S. Chakrabarti, Dynamic personalized pagerank in entity-relation graphs, WWW’07 (2007) 571–580.
[31]
R. Lichtenwalter, J.T. Lussier, N.V. Chawla, New perspectives and methods in link prediction, KDD’10 (2010) 243–252.
[33]
J. McAuley, J. Leskovec, Image labeling on a network: using social-network metadata for image classification, in: Computer Vision ECCV, Springer Berlin Heidelberg, 2012, pp. 828–841.
[35]
Y. Dong, J. Tang, S. Wu, J. Tian, N.V. Chawla, J. Rao, H. Cao, Link prediction and recommendation across heterogeneous social networks, in: 2012 IEEE 12th International Conference on Data Mining (ICDM), IEEE, 2012, pp. 181–190.
[36]
W. Tang, H. Zhuang, J. Tang, Learning to infer social ties in large networks, ECML/PKDD’11 (2011) 381–397.
[37]
J. Leskovec, D. Huttenlocher, J. Kleinberg, Predicting positive and negative links in online social networks, WWW’10 (2010) 641–650.
[38]
J. Hopcroft, T. Lou, J. Tang, Who will follow you back? Reciprocal relationship prediction, in: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, ACM, 2011, pp. 1137–1146.
[39]
J.D. Lafferty, A. McCallum, F.C.N. Pereira, Conditional random fields: probabilistic models for segmenting and labeling sequence data, ICML’01 (2001) 282–289.
[40]
T. Takashita, Y. Abe, T. Itokawa, et al., Design and implementation of a system for finding appropriate tags to photos in Flickr from Web browsing behaviour, Int. J. Web Grid Serv. 7 (1) (2011) 75–90.
[41]
G. Begelman, P. Keller, F. Smadja, Automated tag clustering: improving search and exploration in the tag space, in: Collaborative Web Tagging Workshop at WWW2006, Edinburgh, Scotland, May 2006, pp. 15–33.
[42]
C. Wartena, R. Brussee, M. Wibbels, Using tag co-occurrence for recommendation, in: Ninth International Conference on Intelligent Systems Design and Applications, 2009, ISDA’09, IEEE, November 2009, pp. 273–278.
[43]
F.M. Belm, E.F. Martins, J.M. Almeida, M.A. Gonalves, G.L. Pappa, Exploiting co-occurrence and information quality metrics to recommend tags in web 2.0 applications, in: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, ACM, October 2010, pp. 1793–1796.
[44]
B. Sigurbjrnsson, R. Van Zwol, Flickr tag recommendation based on collective knowledge, in: Proceedings of the 17th International Conference on World Wide Web, ACM, April 2008, pp. 327–336.
[45]
R. Abbasi, S. Staab, Introducing triple play for improved resource retrieval in collaborative tagging systems, in: Proceedings of the ECIR Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR’08), Barcelona, March 30, 2008.
[46]
H. Halpin, V. Robu, H. Shepherd, The complex dynamics of collaborative tagging, in: Proceedings of WWW’07, New York, NY, USA, ACM, 2007, pp. 211–220.
[47]
A. Sun, S.S. Bhowmick, J.A. Chong, Social image tag recommendation by concept matching, in: Proceedings of the 19th ACM International Conference on Multimedia, ACM, November 2011, pp. 1181–1184.
[48]
R. Krestel, L. Chen, Using co-occurrence of tags and resources to identify spammers, Proceedings of 2008 ECML/PKDD Discovery Challenge Workshop (September 2008) 38–46.
[49]
J. Peng, D.D. Zeng, H. Zhao, F.Y. Wang, Collaborative filtering in social tagging systems based on joint item-tag recommendations, in: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, ACM, October 2010, pp. 809–818.
[50]
W. Liu, A. Sidhu, A.M. Beacom, et al., Social network theory The International Encyclopedia of Media Effects, 2017.
[51]
J. Zhang, J. Chen, S. Zhi, et al., Link prediction across aligned networks with sparse and low rank matrix estimation, in: 2017 IEEE 33rd International Conference on Data Engineering (ICDE), IEEE, 2017, pp. 971–982.
[52]
M. Gao, L. Chen, B. Li, et al., Projection-based link prediction in a bipartite network, Inf. Sci. 376 (2017) 158–171.
[53]
C. Fan, Z. Liu, X. Lu, et al., An efficient link prediction index for complex military organization, Physica A: Stat. Mech. Appl. 469 (2017) 572–587.
[54]
M.E.J. Newman, Clustering and preferential attachment in growing networks, Phys. Rev. E 64 (2) (2001) 025102.
[55]
A. Jonnalagadda, L. Kuppusamy, A survey on game theoretic models for community detection in social networks, Soc. Netw. Anal. Min. 6 (1) (2016) 83.
[56]
L. Backstrom, J. Leskovec, Supervised random walks: predicting and recommending links in social networks, WSDM’11 (2011) 635–644.
[57]
J. Kunegis, A. Lommatzsch, Learning spectral graph transformations for link prediction, ICML’09 (2009) 561–568.
[58]
R.N. Lichtenwalter, N.V. Chawlai, Vertex collocation profiles: subgraph counting for link analysis and prediction, WWW’12 (2012) 1019–1028.
[59]
X. Yu, Q. Gu, M. Zhou, J. Han, Citation prediction in heterogeneous bibliographic networks, Proceedings of the 2012 SIAM Conference on Data Mining (SDM 2012) (2012).
[60]
Y. Sun, J. Han, C.C. Aggarwal, N.V. Chawla, When will it happen? Relationship prediction in heterogeneous information networks, in: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, ACM, February 2012, pp. 663–672.

Cited By

View all
  • (2023)Which account will you follow? Recommending influential accounts on social mediaMultimedia Tools and Applications10.1007/s11042-023-14538-382:22(34053-34074)Online publication date: 1-Sep-2023
  • (2022)Ranked enumeration of join queries with projectionsProceedings of the VLDB Endowment10.14778/3510397.351040115:5(1024-1037)Online publication date: 18-May-2022
  • (2020)Measure User Intimacy by Mining Maximum Information Transmission PathsComplexity10.1155/2020/23764512020Online publication date: 21-Mar-2020

Index Terms

  1. Inferring tag co-occurrence relationship across heterogeneous social networks
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Applied Soft Computing
      Applied Soft Computing  Volume 66, Issue C
      May 2018
      564 pages

      Publisher

      Elsevier Science Publishers B. V.

      Netherlands

      Publication History

      Published: 01 May 2018

      Author Tags

      1. Tag co-occurrence
      2. Link prediction
      3. Heterogeneous network
      4. Flickr
      5. Weight path

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 24 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Which account will you follow? Recommending influential accounts on social mediaMultimedia Tools and Applications10.1007/s11042-023-14538-382:22(34053-34074)Online publication date: 1-Sep-2023
      • (2022)Ranked enumeration of join queries with projectionsProceedings of the VLDB Endowment10.14778/3510397.351040115:5(1024-1037)Online publication date: 18-May-2022
      • (2020)Measure User Intimacy by Mining Maximum Information Transmission PathsComplexity10.1155/2020/23764512020Online publication date: 21-Mar-2020

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media