Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1183614.1183682acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Automatic computation of semantic proximity using taxonomic knowledge

Published: 06 November 2006 Publication History

Abstract

Taxonomic measures of semantic proximity allow us to compute the relatedness of two concepts. These metrics are versatile instruments required for diverse applications, e.g., the Semantic Web, linguistics, and also text mining. However, most approaches are only geared towards hand-crafted taxonomic dictionaries such as WordNet, which only feature a limited fraction of real-world concepts. More specific concepts, and particularly instances of concepts, i.e., names of artists, locations, brand names, etc., are not covered.The contributions of this paper are two fold. First, we introduce a framework based on Google and the Open Directory Project (ODP), enabling us to derive the semantic proximity between arbitrary concepts and instances. Second, we introduce a new taxonomy-driven proximity metric tailored for our framework. Studies with human subjects corroborate our hypothesis that our new metric outperforms benchmark semantic proximity metrics and comes close to human judgement.

References

[1]
Baeza-Yates, R., and Ribeiro-Neto, B. Modern Information Retrieval. Addison-Wesley, Reading, MA, USA, May 1999.
[2]
Breese, J., Heckerman, D., and Kadie, C. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the Fourteenth Annual Conference on Uncertainty in Artificial Intelligence (Madison, WI, USA, July 1998), Morgan Kaufmann, pp. 43--52.
[3]
Budanitsky, A., and Hirst, G. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Proceedings of the Workshop on WordNet and Other Lexical Resources (Pittsburgh, PA, USA, June 2000).
[4]
Chien, S., and Immorlica, N. Semantic similarity between search engine queries using temporal correlation. In Proceedings of the 14th International World Wide Web Conference (Chiba, Japan, May 2005), ACM Press.
[5]
Chirita, P.-A., Nejdl, W., Paiu, R., and Kohlschütter, C. Using odp metadata to personlize search. In Proceedings of the 28th International ACM SIGIR Conference on Research and Development in Information Retrieval (Salvador, Brazil, August 2005), ACM Press.
[6]
Cimiano, P., Handschuh, S., and Staab, S. Towards the self-annotating web. In Proceedings of the 13th International World Wide Web Conference (New York, NY, USA, 2004), ACM Press, pp. 462--471.
[7]
Cimiano, P., Ladwig, G., and Staab, S. Gimme' the context: context-driven automatic semantic annotation with c-pankow. In Proceedings of the 14th International World Wide Web Conference (Chiba, Japan, 2005), ACM Press, pp. 332--341.
[8]
Ganesan, P., Garcia-Molina, H., and Widom, J. Exploiting hierarchical domain structure to compute similarity. ACM Transactions on Information Systems 21, 1 (2003), 64--93.
[9]
Jiang, J., and Conrath, D. Semantic similarity based on corpus statistics and lexical taxonomy. In Proceedings of the International Conference on Research in Computational Linguistics (Taiwan, 1997).
[10]
Li, Y., Bandar, Z., and McLean, D. An approach for measuring semantic similarity between words using multiple information sources. IEEE Transactions on Knowledge and Data Engineering 15, 4 (2003), 871--882.
[11]
Lin, D. An information-theoretic definition of similarity. In Proceedings of the Fifteenth International Conference on Machine Learning (San Francisco, CA, USA, 1998), Morgan Kaufmann, pp. 296--304.
[12]
Maguitman, A., Menczer, F., Roinestad, H., and Vespignani, A. Algorithmic detection of semantic similarity. In Proceedings of the 14th International World Wide Web Conference (Chiba, Japan, 2005), ACM Press, pp. 107--116.
[13]
Miller, G. Wordnet: A lexical database for english. Communications of the ACM 38, 11 (1995), 39--41.
[14]
Miller, G., and Charles, W. Contextual correlates of semantic similarity. Language and Cognitive Processes 6, 1 (February 1991), 1--28.
[15]
Resnik, P. Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (Montreal, Canada, 1995), pp. 448--453.
[16]
Resnik, P. Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research 11 (1999), 95--130.
[17]
Rubenstein, H., and Goodenough, J. Contextual correlates of synonymy. Communications of the ACM 8, 10 (1965), 627--633.
[18]
Sarwar, B., Karypis, G., Konstan, J., and Riedl, J. Item-based collaborative filtering recommendation algorithms. In Proceedings of the Tenth International World Wide Web Conference (Hong Kong, China, May 2001).
[19]
van Rijsbergen, K. Information Retrieval. Butterworths, London, UK, 1975.
[20]
Vlachos, M., Meek, C., Vagena, Z., and Gunopulos, D. Identifying similarities, periodicities and bursts for online search queries. In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data (Paris, France, 2004), ACM Press, pp. 131--142.
[21]
Wen, J.-R., Nie, J.-Y., and Zhang, H.-J. Clustering user queries of a search engine. In Proceedings of the 10th International World Wide Web Conference (Hong Kong, China, 2001), ACM Press, pp. 162--168.
[22]
Ziegler, C.-N., Lausen, G., and Schmidt-Thieme, L. Taxonomy-driven computation of product recommendations. In Proceedings of the 2004 ACM CIKM Conference on Information and Knowledge Management (Washington, D.C., USA, November 2004), ACM Press, pp. 406--415.
[23]
Ziegler, C.-N., McNee, S., Konstan, J., and Lausen, G. Improving recommendation lists through topic diversification. In Proceedings of the 14th International World Wide Web Conference/(Chiba, Japan, May 2005), ACM Press.

Cited By

View all
  • (2023)Semantic relatedness in DBpedia: A comparative and experimental assessmentInformation Sciences10.1016/j.ins.2022.11.025621(474-505)Online publication date: Apr-2023
  • (2023)A Game with a Purpose for Building Crowdsourced Semantic Relations Datasets for Named EntitiesIntelligent Computing10.1007/978-3-031-37963-5_30(422-439)Online publication date: 20-Aug-2023
  • (2019)A survey of semantic relatedness evaluation datasets and proceduresArtificial Intelligence Review10.1007/s10462-019-09796-3Online publication date: 23-Dec-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management
November 2006
916 pages
ISBN:1595934332
DOI:10.1145/1183614
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. accuracy
  2. data extraction
  3. metrics
  4. semantic similarity
  5. taxonomy

Qualifiers

  • Article

Conference

CIKM06
CIKM06: Conference on Information and Knowledge Management
November 6 - 11, 2006
Virginia, Arlington, USA

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Semantic relatedness in DBpedia: A comparative and experimental assessmentInformation Sciences10.1016/j.ins.2022.11.025621(474-505)Online publication date: Apr-2023
  • (2023)A Game with a Purpose for Building Crowdsourced Semantic Relations Datasets for Named EntitiesIntelligent Computing10.1007/978-3-031-37963-5_30(422-439)Online publication date: 20-Aug-2023
  • (2019)A survey of semantic relatedness evaluation datasets and proceduresArtificial Intelligence Review10.1007/s10462-019-09796-3Online publication date: 23-Dec-2019
  • (2017)Job to Major (J2M): An Open Source Based Application2017 International Conference on Computer Network, Electronic and Automation (ICCNEA)10.1109/ICCNEA.2017.58(101-107)Online publication date: Sep-2017
  • (2016)LWCRNeurocomputing10.1016/j.neucom.2016.08.045216:C(816-843)Online publication date: 5-Dec-2016
  • (2016)Derivation of "is a" taxonomy from Wikipedia Category GraphEngineering Applications of Artificial Intelligence10.1016/j.engappai.2016.01.03350:C(265-286)Online publication date: 1-Apr-2016
  • (2013)Enabling smart cities through a cognitive management framework for the internet of thingsIEEE Communications Magazine10.1109/MCOM.2013.652560251:6(102-111)Online publication date: Jun-2013
  • (2012)Semantic Learning Service PersonalizedInternational Journal of Computational Intelligence Systems10.1080/18756891.2012.6705285:1(163-172)Online publication date: Feb-2012
  • (2012)Recent advances in methods of lexical semantic relatedness – a surveyNatural Language Engineering10.1017/S135132491200012519:04(411-479)Online publication date: 4-May-2012
  • (2012)Linked data-based concept recommendationProceedings of the 9th international conference on The Semantic Web: research and applications10.1007/978-3-642-30284-8_9(24-38)Online publication date: 27-May-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media