Abstract
Structured document retrieval requires the ranking of document elements. Previous approaches either aggregate term weights or retrieval status values, or propose alternatives to idf, for example, ief (inverse element frequency). We propose and investigate in this paper a new approach: Context-specific idf, which is, in contrast to aggregation-based ranking functions, parameter-free.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Callan, J.P.: Passage-level evidence in document retrieval. In: Proceedings of the Seventeenth Annual International ACM SIGIR, pp. 302–310 (1994)
Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: Proceedings of the 18th Annual International ACM SIGIR, pp. 21–29 (1995)
Church, K., Gale, W.: Inverse document frequency (idf): A measure of deviation from poisson. In: Proceedings of the Third Workshop on Very Large Corpora, pp. 121–130 (1995)
Fuhr, N., Grossjohann, K.: XIRQL: A query language for information retrieval in XML documents. In: Proceedings of the 24th Annual International ACM SIGIR. ACM, New York (2001)
Grabs, T., Schek, H.J.: Generating vector spaces on-the-fly for flexible xml retrieval. In: Proceedings of the ACM SIGIR Workshop on XML and Information Retrieval, Tampere, Finland, pp. 4–13 (2002)
Mass, Y., Mandelbrod, M.: Retrieving the most relevant xml component. In: Proceedings of the Second Workshop of INEX, Germany, pp. 53–58 (2003)
Ogilvie, P., Callan, J.: Language models and structured document retrieval (2003)
Roelleke, T., Lalmas, M., et al.: The accessibility dimension for structured document retrieval. In: Proceedings of the BCS-IRSG European ECIR (2002)
Schlieder, T., Meuss, H.: Querying and ranking xml documents. J. Am. Soc. Inf. Sci. Technol. 53(6), 489–503 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, J., Roelleke, T. (2006). Context-Specific Frequencies and Discriminativeness for the Retrieval of Structured Documents. In: Lalmas, M., MacFarlane, A., Rüger, S., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds) Advances in Information Retrieval. ECIR 2006. Lecture Notes in Computer Science, vol 3936. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11735106_69
Download citation
DOI: https://doi.org/10.1007/11735106_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33347-0
Online ISBN: 978-3-540-33348-7
eBook Packages: Computer ScienceComputer Science (R0)