Abstract
This research suggests a method for query expansion on Arabic Information Retrieval using Expectation Maximization (EM). We employ the EM algorithm in the process of selecting relevant terms for expanding the query and weeding out the non-related terms. We tested our algorithm on INFILE test collection of CLLEF2009, and the experiments show that query expansion that considers similarity of terms both improves precision and retrieves more relevant documents. The main finding of this research is that we can increase the recall while keeping the precision at the same level by this method.
Chapter PDF
Similar content being viewed by others
References
Staff, C., Muscat, R.: Expanding Query Terms in Context. In: Proceedings of Computer Science Annual Workshop (CSAW 2004), pp. 106–108. University of Malta (2004)
Bacchin, M., Melucci, M.: Expanding Queries using Stems and Symbols. In: Proceedings of the 13th Text REtrieval Conference (TREC 2004), Genomics Track, Gaithersburg, MD, USA (November 2004)
MartÃnez-Fernández, J.L., GarcÃa-Serrano, A.M., Villena Román, J., MartÃnez, P.: Expanding Queries Through Word Sense Disambiguation. In: Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., Stempfhuber, M. (eds.) CLEF 2006. LNCS, vol. 4730, pp. 613–616. Springer, Heidelberg (2007)
Manning, C.D., Raghavan, P., Schtze, H.: Relevance feedback and query expansion. In: Introduction to Information Retrieval. Cambridge University Press, New York (2008)
Crestani, F.: Comparing neural and probabilistic relevance feedback in an interactive Information Retrieval system. In: Proceedings of the IEEE International Conference on Neural Networks, Orlando, Florida, USA, pp. 3426–2430 (June 1994)
http://trec.nist.gov (visited on September 2010)
Magennis, M., van Rijsbcrgen, C.: The potential and actual effectiveness of interactive query expansion. In: Proceedings of ACM Special Interest Group in Information Retrieval Conference (SIGIR 1997), pp. 324–332 (1997)
Klink, S., Hust, A., Junker, M., Dengel, A.: Improving Document Retrieval by Automatic Query Expansion Using Collaborative Learning of Term-Based Concepts. Document Analysis Systems, 376–387 (2002)
Khafajeh, H., Kanaan, G., Yaseen, M., Al-Sarayreh, B.: Automatic Query Expansion for Arabic Text Retrieval Based on Association and Similarity Thesaurus. In: Proceedings he European, Mediterranean & Middle Eastern Conference on Information Systems (EMCIS), Abu Dhabi, UAE (2010)
Rachidi, T., Bouzoubaa, M., ElMortaji, L., Boussouab, B., Bensaid, A.: Arabic user search Query correction and expansion. In: Proceedings of COPSTIC 2003, Rabat, December 11-13 (2003)
Bilotti, M.: Query expansion techniques for question answering. Master’s thesis, Massachusetts Institute of Technology (2004)
Al-Shalabi, R., Kanaan, G., Yaseen, M., Al-Sarayreh, B., Al-Naji, N.: Arabic Query Expansion Using Interactive Word Sense Disambiguation. In: 2nd International Conference on Arabic Language Resources & Tools, MEDAR, Cairo, Egypt, pp. 156–158 (April 2009)
Zitouni, A., Damankesh, A., Barakati, F., Atari, M., Watfa, M., Oroumchian, F.: Corpus-Based Arabic Stemming Using N-Grams. In: Cheng, P.-J., Kan, M.-Y., Lam, W., Nakov, P. (eds.) AIRS 2010. LNCS, vol. 6458, pp. 280–289. Springer, Heidelberg (2010)
Attia, M.: Arabic tokenization system. In: Proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, pp. 65–72 (2007)
Farghaly, A., Shaalan, K.: Arabic Natural Language Processing: Challenges and Solutions. ACM Transactions on Asian Language Information Processing (TALIP) 8(4), 1–22 (2009)
Habash, N.Y.: Introduction to Arabic Natural Language Processing (Synthesis lectures on human language technologies). Morgan & Claypool (2010)
Besançon, R., Chaudiron, S., Mostefa, D., Timimi, I., Choukri, K.: The INFILE Project: a Cross-lingual Filtering Systems Evaluation Campaign. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), Marrakech, Morocco (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 IFIP International Federation for Information Processing
About this paper
Cite this paper
Shaalan, K., Al-Sheikh, S., Oroumchian, F. (2012). Query Expansion Based-on Similarity of Terms for Improving Arabic Information Retrieval. In: Shi, Z., Leake, D., Vadera, S. (eds) Intelligent Information Processing VI. IIP 2012. IFIP Advances in Information and Communication Technology, vol 385. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32891-6_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-32891-6_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32890-9
Online ISBN: 978-3-642-32891-6
eBook Packages: Computer ScienceComputer Science (R0)