Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Relational term-suggestion graphs incorporating multipartite concept and expertise networks

Published: 03 January 2014 Publication History

Abstract

Term suggestions recommend query terms to a user based on his initial query. Suggesting adequate terms is a challenging issue. Most existing commercial search engines suggest search terms based on the frequency of prior used terms that match the leading alphabets the user types. In this article, we present a novel mechanism to construct semantic term-relation graphs to suggest relevant search terms in the semantic level. We built term-relation graphs based on multipartite networks of existing social media, especially from Wikipedia. The multipartite linkage networks of contributor-term, term-category, and term-term are extracted from Wikipedia to eventually form term relation graphs. For fusing these multipartite linkage networks, we propose to incorporate the contributor-category networks to model the expertise of the contributors. Based on our experiments, this step has demonstrated clear enhancement on the accuracy of the inferred relatedness of the term-semantic graphs. Experiments on keyword-expanded search based on 200 TREC-5 ad-hoc topics showed obvious advantage of our algorithms over existing approaches.

References

[1]
Amazon Mechanical Turk https://requester.mturk.com/.
[2]
Baeza-Yates, R. and Tiberi, A. 2007. Extracting semantic relations from query logs. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07). 76--85.
[3]
Beeferman, D. and Berger, A. 2000. Agglomerative clustering of a search engine query log. In Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD'00). 407--416.
[4]
Becerra-Fernandez, I. 2001. Searching for experts with expertise-locator knowledge management systems. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL'01) Workshop Human Language Technology and Knowledge Management. 9--16.
[5]
sBenz, D., Hotho, A., R. Jäschke, R., Krause, B., Mitzlaff, F., Schmitz, C., and Stumme, G. 2010. The social bookmark and publication management system bibsonomy. VLDB J. 19, 6, 849--875.
[6]
Budanitsky, A. and Hirst, G. 2006. Evaluating WordNet-based measures of lexical semantic relatedness. Comput. Linguistics 32, 13--47.
[7]
Bull, S., Greer, J., McCalla, G., Lori, K., and Bowes, J. 2001. User modeling in I-Help: What, why, when and how. In Proceedings of the 8th International Conference on User Modeling. 117--126.
[8]
Carpineto, C., De Mori, R., Romano, G., and Bigi, B. 2000. An information-theoretic approach to automatic query expansion. ACM Trans. Inf. Syst. 19, 1, 1--27.
[9]
Chung, Y. M. and Lee, J. Y. 2001. A corpus-based approach to comparative evaluation of statistical term association measures. J. Am. Soc. Inf. Sci. Tech. 52, 4, 283--296.
[10]
Dmitriev, P., Serdyukov, P., and Chernov, S. 2010. Enterprise and desktop search. In Proceedings of the 19th International World Wide Web Conference (WWW'10). 1345--1346.
[11]
Fan, J., Wu, H., Li, G., and Zhou, L. 2010. Suggesting topic-based query terms as you type. In Proceedings of the 12th International Asia Pacific Web Conference (APWEB'10). 61--67.
[12]
Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan, Z., Wolfman, G., and Ruppin, E. 2002. Placing search in context: The concept revisited. ACM Trans. Inf. Syst. 20, 1, 116--131.
[13]
Gabrilovich, E. and Markovitch, S. 2005. Feature generation for text categorization using world knowledge. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI'05). 1048--1053.
[14]
Gabrilovich, E. and Markovitch, S. 2006. Overcoming the brittleness bottleneck using Wikipedia: Enhancing text categorization with encyclopedic knowledge, In Proceedings of the 21th Conference of the American Association for Artificial Intelligence (AAAI'06). 1301--1306.
[15]
Gligorov, R., Aleksovski, Z., Kate, W., and Harmelen, F. 2007. Using Google Distance to weight approximate ontology matches. In Proceedings of the 16th international Conference on World Wide Web (WWW'07). 767--776.
[16]
Gracia, J. and Mena, E. 2008. Web-Based measure of semantic relatedness. In Proceedings of the 9th International Conference on Web Information Systems Engineering (WISE'08). 136--150.
[17]
Harvey, M., Baillie, M., Ruthven, I., and Carman, M. 2010. Tripartite hidden topic models for personalized tag suggestion. In Proceedings of the 32nd European Conference on IR Research. 432--443.
[18]
Henzinger, M. 2001. Hyperlink analysis for the Web. IEEE Internet Comput. 5, 1, 45--50.
[19]
Jin, E., Girvan, M., and Newman, M. 2001. The structure of growing social networks. Phys. Rev. E, 64, 4, 046132.
[20]
Kim, M. and Choi, K. A. 1999. Comparison of collocation-based similarity measures in query expansion. Inf. Process. Manage. 35, 19--30.
[21]
Kleinberg, J. 1998. Authoritative sources in a hyperlinked environment. In Proceedings of the 9th ACM SIAM Symposium on Discrete Algorithms. 668--677.
[22]
Lappas, T., Liu, K., and Terzi, E. 2009. Finding a team of experts in social networks. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). 467--476.
[23]
Ma, H., Yang, H., King, I., and Lyu, M. 2008. Learning latent semantic relations from click through data for query suggestion. In Proceedings of the 17th ACM Conference on Information and Knowledge Management (CIKM'08). 709--718.
[24]
Milne, D., Witten, I. H., and Nichols, D. 2007. A knowledge-based search engine powered by Wikipedia. In Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM'07). 445--454.
[25]
Milne, D. and Witten, I. H. 2008. An effective, low-cost measure of semantic relatedness obtained from Wikipedia links. In Proceedings of the 1st AAAI Workshop on Wikipedia and Artificial Intelligence (WIKIAI'08).
[26]
ODM. 1998. http://www.dmoz.org/.
[27]
Page, L., Brin, S., Motwani, R., and Winograd, T. 1999. The PageRank citation ranking: Bringing order to the Web. Tech. Rep., SIDL-WP-1999-0120, Stanford University.
[28]
Salton, G. and Buckley, C. 1988. Term-weighting approaches in automatic text retrieval. Information processing & management. Elsevier Inf. Process. Manage. 24, 5, 513--523.
[29]
Sarwar, B., Karypis, G., Konstan, J., and Riedl, J. 2001. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International World Wide Web Conference (WWW'01). 285--295.
[30]
Schonhofen, P. 2006. Identify document topics using the Wikipedia category network. In Proceedings of the International Conference on Web Intelligent Conference (WI'06). 456--462.
[31]
Shieh, J. R., Hsieh, Y. H., Yeh, Y. T., Su, T. C., Lin, C. Y., and Wu, J. L. 2009. Building term suggestion relational graphs from collective intelligence. In Proceedings of the 18th International World Wide Web Conference (WWW'09). 713--721.
[32]
Silverstein, C., Henzinger M., Marais, H., and Moricz, M. 1998. Analysis of a very large AltaVista query log. Tech. rep. 1998014; Digital Systems Research Center.
[33]
Song, X., Chi, Y., Hino, K., and Tseng, B. 2007. Identifying opinion leaders in blogosphere In Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM'07). 971--974.
[34]
Staab, S. 2001. Human language technologies for knowledge management. IEEE Intell. Syst. 16, 6, 84--94.
[35]
Symonds, M., Bruza, D. P., Sitbon, L., and Turber, L. 2011. Tensor Query Expansion: A cognitively motivated relevance model. In Proceeding of the 16th Australasian Document Computing Symposium (ADCS'11).
[36]
Trec-5. The Fifth Text REtrieval Conference (TREC-5) NIST.
[37]
Vectomova, O. and Wang, Y. 2006. A study of the effect of term proximity on query expansion. J. Inf. Sci. 32, 4, 324--333.
[38]
Wang, P. and Domeniconi, C. 2008. Building semantic kernels for text classification using Wikipedia. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08). 713--721.
[39]
Wang, P., Domeniconi, C., and Hu, J. 2008. Using Wikipedia for co-clustering based cross-domain text classification. In Proceedings of the 2008 International Conference on Data Mining (ICDM'08). 1085--1090.
[40]
Wasserman, S. and Faust, K. 1995. Social Network Analysis: Theory and Methods. Cambridge University Press.
[41]
Watts, D. 1999. Small Worlds: The Dynamics of Networks between Order and Randomness. Princeton University Press.
[42]
Wei, X. and Croft, W. B. 2006. LDA-based document models for ad-hoc retrieval. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'06). 178--185.
[43]
White, S. and Smyth, P. 2003. Algorithms for estimating relative importance in networks. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03). 266--275.
[44]
WordNet. 1998. An Electronic Lexical Database. The MIT Press.
[45]
Xiang, W., Cao, B., Hu, H., and Yang, Q. 2010. Bridging domains using world wide knowledge for transfer learning. IEEE Trans. Knowl. Data Eng. 22, 6, 770--783.
[46]
Yeh, E., Ramage, D., Manning, C. D., Agirre, E., and Soroa, A. 2009. WikiWalk: random walks on Wikipedia for semantic relatedness. In Proceedings of the Workshop on Graph-Based Methods for Natural Language Processing. Association for Computational Linguistics, Stroudsburg, PA, 41--49.
[47]
Xu, J. 1996. Query expansion using local and global document analysis. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'96). 4--11.
[48]
Yu, K., Yu, S., and Tresp, V. 2005. Soft clustering on graphs. In Advances in Neural Information Processing Systems (NIPS'05), MIT Press.
[49]
Zhang, J., Tang, J., and Li, J. 2007. Expert finding in a social network. In Advances in Databases: Concepts, Systems and Applications, Lecture Notes in Computer Science, vol. 4443, 1066--1069.
[50]
Zhang, Y., Callan, J., and Minka, T. 2002. Novelty and redundancy detection in adaptive filtering, In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'02). 81--88.
[51]
Zhao, S., Wang, H., and Liu, T. 2010. Paraphrasing with search engine query logs. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING'10). 1317--1325

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology
ACM Transactions on Intelligent Systems and Technology  Volume 5, Issue 1
Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
December 2013
520 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/2542182
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 January 2014
Accepted: 01 April 2012
Revised: 01 February 2011
Received: 01 August 2010
Published in TIST Volume 5, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Social network
  2. keyword expansion reranking

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 237
    Total Downloads
  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)1
Reflects downloads up to 23 Jan 2025

Other Metrics

Citations

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media