Abstract
This paper presents some initial explorations into how to compute term similarity across different domains, or in the present case, scientific disciplines. In particular we explore the concepts of polysemy across disciplines, where the same term can have different meaning across different discipline. This can lead to confusion and/or erroneous query expansion, if the domain is not properly identified. Typical bag-of-words systems are not equipped to highlight such differences as terms would have a single representation. Identifying the synonymy of terms across different domains is also a difficult problem for typical bag-of-words systems, as they use surrounding words that will usually also be different across domains. Yet discovering such similarities across domains can support tasks such as literature discovery. We propose an approach that integrates knowledge based distances into a distributional semantics framework and demonstrate its efficiency on a hand-crafted dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Blitzer, J., Mcdonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: Proceedings of EMNLP, pp. 120–128 (2006)
Bullinaria, J.A., Levy, J.P.: Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD. Behav. Res. Methods 44, 890–907 (2012)
Gopalan, R., Li, R., Chellappa, R.: Domain adaptation for object recognition: an unsupervised approach. In: Proceedings of ICCV, pp. 999–1006 (2011)
Hu, X., Cai, Z., Graesser, A.C., Ventura, M.: Similarity between semantic spaces. In: Proceedings of CogSci 2005, pp. 995–1000 (2005)
Kamps, J., Marx, M., Mokken, R.J., De Rijke, M.: Using wordnet to measure semantic orientations of adjectives. In: Proceedings of LREC, pp. 1115–1118 (2004)
Kim, S.N., Cavedon, L.: Classifying domain-specific terms using a dictionary. In: Proceedings of ALTA, p. 57 (2011)
Bentivogli, L., Forner, P., Magnini, B., Pianta, E.: Revising the wordnet domains hierarchy: semantics, coverage and balancing. In: Proceedings of the Workshop on Multilingual Linguistic Ressources, pp. 101–108. ACL (2004)
Mikolov, T., Zweig, G.: Context dependent recurrent neural network language model. In: Proceedings of SLT, pp. 234–239 (2012)
Pan, S.J., Kwok, J.T., Yang, Q.: Transfer learning via dimensionality reduction. In: Proceedings of AAAI, pp. 677–682 (2008)
Turney, P.D., Pantel, P.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. 37(1), 141–188 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Sheikhadbolkarim, H., Sitbon, L. (2015). Explorations of Cross-Disciplinary Term Similarity. In: Zuccon, G., Geva, S., Joho, H., Scholer, F., Sun, A., Zhang, P. (eds) Information Retrieval Technology. AIRS 2015. Lecture Notes in Computer Science(), vol 9460. Springer, Cham. https://doi.org/10.1007/978-3-319-28940-3_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-28940-3_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28939-7
Online ISBN: 978-3-319-28940-3
eBook Packages: Computer ScienceComputer Science (R0)