Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

A Universal Model for XML Information Retrieval

  • Conference paper
Advances in XML Information Retrieval (INEX 2004)

Abstract

This paper presents an approach for extending the vector space model (VSM) to perform XML retrieval. The model is extended to support important aspects of XML structural and semantic information such as element nesting level, matching tag names in the query and the collection and the relation between tag names and content of an element. Potential use of the model for heterogeneous as well as for the unstructured collection is also shown. We compared our model with the standard vector space model and obtained a gain for unstructured and structured queries. For unstructured collections the vector space model effectiveness is preserved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Abiteboul, S., Buneman, P., Suciu, D.: Data on the Web – From Relations to Semistructured Data in XML, pp. 27–50. Mogan Kaufmann Publishers, San Francisco (2000)

    Google Scholar 

  2. Abolhassani, M., Grobjohann, K., Fuhr, N.: Content-oriented XML Retrieval with HyREX. In: INEX 2002 Workshop Proceedings, Duisburg, pp. 26–32 (2002)

    Google Scholar 

  3. Bray, T., Paoli, J., Sperberg-McQueen, C.M., Maler, E.: Extensible Markup Language (XML) 1.0, October 2000. W3C Recommendation, 2nd edn., October 6 (2000), http://www.w3.org/TR/REC-xml

  4. Fuhr, N., Lalmas, M.: INEX document Collection, Duisburg (2004), http://inex.is.informatik.uni-duisburg.de:2004/internal/

  5. Kazai, G., Lalmas, M., Malik, S.: INEX 2003 Guidelines for Topic Development. In: INEX 2003 Workshop Proceedings, Duisburg, pp. 153–154 (2003)

    Google Scholar 

  6. Mandelbrod, M., Mass, Y.: Retrieving the most relevant XML Components. In: INEX 2003 Workshop Proceedings, Duisburg, pp. 58–64 (2003)

    Google Scholar 

  7. Ribeiro-Neto, B., Baeza-Yates, R.: Modern Information Retrieval, pp. 27–30. Addison Wesley, Reading (1999)

    Google Scholar 

  8. Salton, G., Lesk, M.E.: Computer evaluation of indexing and text processing. Journal of the ACM 15(1), 8–36 (1968)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Azevedo, M.I.M., Amorim, L.P., Ziviani, N. (2005). A Universal Model for XML Information Retrieval. In: Fuhr, N., Lalmas, M., Malik, S., SzlĂ¡vik, Z. (eds) Advances in XML Information Retrieval. INEX 2004. Lecture Notes in Computer Science, vol 3493. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424550_25

Download citation

  • DOI: https://doi.org/10.1007/11424550_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26166-7

  • Online ISBN: 978-3-540-32053-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics