Abstract
This paper presents the 2005 Miracle’s team approach to the Ad-Hoc Information Retrieval tasks. The goal for the experiments this year was twofold: to continue testing the effect of combination approaches on information retrieval tasks, and improving our basic processing and indexing tools, adapting them to new languages with strange encoding schemes. The starting point was a set of basic components: stemming, transforming, filtering, proper nouns extraction, paragraph extraction, and pseudo-relevance feedback. Some of these basic components were used in different combinations and order of application for document indexing and for query processing. Second-order combinations were also tested, by averaging or selective combination of the documents retrieved by different approaches for a particular query. In the multilingual track, we concentrated our work on the merging process of the results of monolingual runs to get the overall multilingual result, relying on available translations. In both cross-lingual tracks, we have used available translation resources, and in some cases we have used a combination approach.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aoe, J.-I., Morimoto, K., Sato, T.: An Efficient Implementation of Trie Structures. Software Practice and Experience 22(9), 695–721 (1992)
CLEF 2005 Multilingual Information Retrieval resources page [Visited 11/08/2005], http://www.computing.dcu.ie/~gjones/CLEF2005/Multi-8/
González, J.C., Goñi-Menoyo, J.M., Villena-Román, J.: MIRACLE’s 2005 Approach to Cross-lingual Information Retrieval. In: Working Notes for the CLEF 2005 Workshop. Vienna, Austria (2005) [Visited 05/11/2005], Online http://clef.isti.cnr.it/2005/working_notes/workingnotes2005/gon-zalez05.pdf
Goñi-Menoyo, J.M., González-Cristóbal, J.C., Fombella-Mourelle, J.: An optimised trie index for natural language processing lexicons. MIRACLE Technical Report. Universidad Politécnica de Madrid (2004)
Goñi-Menoyo, J. M., González, J. C., Villena-Román, J.: MIRACLE’s 2005 Approach to Monolingual Information Retrieval. In: Working Notes for the CLEF 2005 Workshop. Vienna, Austria (2005)[Visited 05/11/2005] On line http://clef.isti.cnr.it/2005/working_notes/workingnotes2005/menoyo05.pdf
Di Nunzio, G.M., Ferro, N., Jones, G.J.F.: CLEF 2005: Ad Hoc Track Overview. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 11–36. Springer, Heidelberg (2006)
Peters, C.: What happened in CLEF 2005. In: Proceedings of the Cross Language Evaluation Forum 2005. Lecture Notes in Computer science, vol. 4022, Springer, Heidelberg (2006)
Porter, M.: Snowball stemmers and resources page [Visited 13/07/2005], Online http://www.snowball.tartarus.org
Robertson, S.E., et al.: Okapi at TREC-3. In: Harman, D.K. (ed.) Overview of the Third Text REtrieval Conference (TREC-3), NIST, Gaithersburg, MD (1995)
Savoy, J.: Report on CLEF-2003 Multilingual Tracks. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 64–73. Springer, Heidelberg (2004)
University of Neuchatel. Page of resources for CLEF (Stopwords, transliteration, stemmers..) [Visited 13/07/2005], On line http://www.unine.ch/info/clef
Xapian: an Open Source Probabilistic Information Retrieval library [Visited 13/07/2005], On line http://www.xapian.org
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Goñi-Menoyo, J.M., González-Cristóbal, J.C., Villena-Román, J. (2006). MIRACLE at Ad-Hoc CLEF 2005: Merging and Combining Without Using a Single Approach. In: Peters, C., et al. Accessing Multilingual Information Repositories. CLEF 2005. Lecture Notes in Computer Science, vol 4022. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11878773_4
Download citation
DOI: https://doi.org/10.1007/11878773_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45697-1
Online ISBN: 978-3-540-45700-8
eBook Packages: Computer ScienceComputer Science (R0)