Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2232817.2232833acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

Modeling and exploiting heterogeneous bibliographic networks for expertise ranking

Published: 10 June 2012 Publication History

Abstract

Recently expertise retrieval has received increasing interests in both academia and industry. Finding experts with demonstrated expertise for a given query is a nontrivial task especially from a large-scale Web 2.0 systems, such as question answering and bibliography data, where users are actively publishing useful content online, interacting with each other, and forming social networks in various ways, leading to heterogeneous networks in addition to the large amounts of textual content information. Many approaches have been proposed and shown to be useful for expertise ranking. However, most of these methods only consider the textual documents while ignoring heterogeneous network structures or can merely integrate with one additional kind of information. None of them can fully exploit the characteristics of heterogeneous networks. In this paper, we propose a joint regularization framework to enhance expertise retrieval by modeling heterogeneous networks as regularization constraints on top of document-centric model. We argue that multi-typed linking edges reveal valuable information which should be treated differently. Motivated by this intuition, we formulate three hypotheses to capture unique characteristics for different graphs, and mathematically model those hypotheses jointly with the document and other information. To illustrate our methodology, we apply the framework to expert finding applications using a bibliography dataset with 1.1 million papers and 0.7 million authors. The experimental results show that our proposed approach can achieve significantly better results than the baseline and other enhanced models.

References

[1]
K. Balog, L. Azzopardi, and M. de Rijke. Formal models for expert finding in enterprise corpora. In SIGIR, pages 43--50, 2006.
[2]
J. Bian, Y. Liu, D. Zhou, E. Agichtein, and H. Zha. Learning to recognize reliable users and content in social media with coupled mutual reinforcement. In WWW, pages 51--60, 2009.
[3]
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1--7):107--117, 1998.
[4]
C. Buckley and E. M. Voorhees. Retrieval evaluation with incomplete information. In SIGIR, pages 25--32, 2004.
[5]
C. S. Campbell, P. P. Maglio, A. Cozzi, and B. Dom. Expertise identification using email communications. In CIKM, pages 528--531, 2003.
[6]
H. Deng, J. Han, B. Zhao, Y. Yu and C. X. Lin. Probabilistic topic models with biased propagation on heterogeneous information networks. In KDD, pages 1271--1279, 2011.
[7]
H. Deng, I. King, and M. R. Lyu. Formal models for expert finding on dblp bibliography data. In ICDM, pages 163--172, 2008.
[8]
H. Deng, I. King, and M. R. Lyu. Enhanced models for expertise retrieval using community-aware strategies. IEEE Trans. SMC-B, pages 93--106, 2012.
[9]
F. Diaz. Regularizing ad hoc retrieval scores. In CIKM, pages 672--679, 2005.
[10]
H. Fang and C. Zhai. Probabilistic models for expert finding. In ECIR, pages 418--430, 2007.
[11]
Z. Guan, J. Bu, Q. Mei, C. Chen, and C. Wang. Personalized tag recommendation using graph-based ranking on multi-type interrelated objects. In SIGIR, pages 540--547, 2009.
[12]
C.W. Hsu, C.C. Chang, and C.J. Lin. A practical guide to support vector classification. In Technical Report, 2003.
[13]
M. Karimzadehgan, R. W. White, and M. Richardson. Enhancing expert finding using organizational hierarchies. In ECIR, pages 177--188, 2009.
[14]
J. M. Kleinberg. Authoritative sources in a hyperlinked environment. JACM, 46(5):604--632, 1999.
[15]
X. Liu, J. Bollen, M. L. Nelson, and H. V. de Sompel. Co-authorship networks in the digital library research community. IPM, 41(6):1462--1480, 2005.
[16]
X. Liu, W. B. Croft, and M. B. Koll. Finding experts in community-based question-answering services. In CIKM, pages 315--316, 2005.
[17]
Y. Lu, P. Tsaparas, A. Ntoulas, and L. Polanyi. Exploiting social context for review quality prediction. In WWW, pages 691--700, 2010.
[18]
C. Macdonald and I. Ounis. Voting for candidates: adapting data fusion techniques for an expert search task. In CIKM, pages 387--396, 2006.
[19]
Q. Mei, D. Cai, D. Zhang, and C. Zhai. Topic modeling with network regularization. In WWW, pages 101--110, 2008.
[20]
Z. Nie, Y. Zhang, J.-R. Wen, and W.-Y. Ma. Object-level ranking: bringing order to web objects. In WWW, pages 567--574, 2005.
[21]
M. G. Noll, C. man Au Yeung, N. Gibbins, C. Meinel, and N. Shadbolt. Telling experts from spammers: expertise ranking in folksonomies. In SIGIR, pages 612--619, 2009.
[22]
J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In SIGIR, pages 275--281, 1998.
[23]
E. Smirnova and K. Balog. A user-oriented model for expert finding. In ECIR, pages 580--592, 2011.
[24]
I. Soboroff, A. de Vries, and N. Craswell. Overview of the TREC-2006 enterprise track. In Proc. of TREC 2006.
[25]
Y. Sun, Y. Yu, and J. Han. Ranking-based clustering of heterogeneous information networks with star network schema. In KDD, pages 797--806, 2009.
[26]
C. Zhai and J. D. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst., 22(2):179--214, 2004.
[27]
J. Zhang, M. S. Ackerman, and L. Adamic. Expertise networks in online communities: structure and algorithms. In WWW, pages 221--230, 2007.
[28]
J. Zhang, J. Tang, and J.-Z. Li. Expert finding in a social network. In DASFAA, pages 1066--1069, 2007.
[29]
D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Schölkopf. Learning with local and global consistency. In NIPS, 2003.
[30]
D. Zhou, S. A. Orshanskiy, H. Zha, and C. L. Giles. Co-ranking authors and documents in a heterogeneous network. In ICDM, pages 739--744, 2007.
[31]
D. Zhou, S. Zhu, K. Yu, X. Song, B. L. Tseng, H. Zha, and C. L.Giles. Learning multiple graphs for document recommendations. In WWW, pages 141--150, 2008.
[32]
X. Zhu, Z. Ghahramani, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In ICML, pages 912--919, 2003.

Cited By

View all
  • (2023)Optimized Doctor Recommendation System using Supervised Machine LearningProceedings of the 24th International Conference on Distributed Computing and Networking10.1145/3571306.3571372(360-365)Online publication date: 4-Jan-2023
  • (2023)Exploring the Significance of Publication-Age-Based Parameters for Evaluating Researcher ImpactIEEE Access10.1109/ACCESS.2023.330401311(86597-86610)Online publication date: 2023
  • (2022)Automated disease diagnosis and precaution recommender system using supervised machine learningMultimedia Tools and Applications10.1007/s11042-022-12897-x81:22(31929-31952)Online publication date: 11-Apr-2022
  • Show More Cited By

Index Terms

  1. Modeling and exploiting heterogeneous bibliographic networks for expertise ranking

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      JCDL '12: Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
      June 2012
      458 pages
      ISBN:9781450311540
      DOI:10.1145/2232817
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 10 June 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. expertise ranking
      2. graph regularization
      3. heterogeneous bibliographic network
      4. probabilistic model

      Qualifiers

      • Research-article

      Conference

      JCDL '12
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 415 of 1,482 submissions, 28%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)6
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 30 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Optimized Doctor Recommendation System using Supervised Machine LearningProceedings of the 24th International Conference on Distributed Computing and Networking10.1145/3571306.3571372(360-365)Online publication date: 4-Jan-2023
      • (2023)Exploring the Significance of Publication-Age-Based Parameters for Evaluating Researcher ImpactIEEE Access10.1109/ACCESS.2023.330401311(86597-86610)Online publication date: 2023
      • (2022)Automated disease diagnosis and precaution recommender system using supervised machine learningMultimedia Tools and Applications10.1007/s11042-022-12897-x81:22(31929-31952)Online publication date: 11-Apr-2022
      • (2021)Assessment of author ranking indices based on multi-authorshipScientometrics10.1007/s11192-021-03906-1Online publication date: 6-Mar-2021
      • (2020)Identifying collaboration dynamics of bipartite author-topic networks with the influences of interest changesScientometrics10.1007/s11192-019-03342-2Online publication date: 14-Jan-2020
      • (2019)Expert Finding Systems: A Systematic ReviewApplied Sciences10.3390/app92042509:20(4250)Online publication date: 11-Oct-2019
      • (2019)Automated Expertise RetrievalACM Computing Surveys10.1145/333100052:5(1-30)Online publication date: 13-Sep-2019
      • (2019)Productivity-based Features from Article Metadata for Fuzzy Rules to Classify Academic Expert2019 IEEE 10th International Conference on Awareness Science and Technology (iCAST)10.1109/ICAwST.2019.8923316(1-6)Online publication date: Oct-2019
      • (2019)Comprehensive evaluation of h-index and its extensions in the domain of mathematicsScientometrics10.1007/s11192-019-03007-0118:3(809-822)Online publication date: 1-Mar-2019
      • (2018)SCSMinerWorld Wide Web10.1007/s11280-018-0526-921:6(1523-1543)Online publication date: 1-Nov-2018
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media