Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Prior Art Retrieval Using the Claims Section as a Bag of Words

  • Conference paper
Multilingual Information Access Evaluation I. Text Retrieval Experiments (CLEF 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6241))

Included in the following conference series:

Abstract

In this paper we describe our participation in the 2009 CLEF-IP task, which was targeted at prior-art search for topic patent documents. We opted for a baseline approach to get a feeling for the specifics of the task and the documents used. Our system retrieved patent documents based on a standard bag-of-words approach for both the Main Task and the English Task. In both runs, we extracted the claim sections from all English patents in the corpus and saved them in the Lemur index format with the patent IDs as DOCIDs. These claims were then indexed using Lemur’s BuildIndex function. In the topic documents we also focused exclusively on the claims sections. These were extracted and converted to queries by removing stopwords and punctuation. We did not perform any term selection or query expansion. We retrieved 100 patents per topic using Lemur’s RetEval function, retrieval model TF-IDF. Compared to the other runs submitted to the track, we obtained good results in terms of nDCG (0.46) and moderate results in terms of MAP (0.054).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Roda, G., Tait, J., Piroi, F., Zenz, V.: CLEF-IP 2009: retrieval experiments in the Intellectual Property domain. In: CLEF working notes 2009 (2009)

    Google Scholar 

  2. Piroi, F., Roda, G., Zenz, V.: CLEF-IP 2009 Track Guidelines. Technical report, Information Retrieval Facility (2009)

    Google Scholar 

  3. Graf, E., Azzopardi, L.: A methodology for building a test collection for prior art search. In: Proceedings of the 2nd International Workshop on Evaluating Information Access (EVIA), pp. 60–71 (2008)

    Google Scholar 

  4. Shinmori, A., Okumura, M., Marukawa, Y., Iwayama, M.: Patent claim processing for readability: structure analysis and term explanation. In: Proceedings of the ACL-2003 workshop on Patent corpus processing, pp. 56–65. Association for Computational Linguistics, Morristown (2003)

    Chapter  Google Scholar 

  5. Iwayama, M., Fujii, A., Kando, N., Marukawa, Y.: Evaluating patent retrieval in the third NTCIR workshop. Information Processing Management 42(1), 207–221 (2006)

    Article  Google Scholar 

  6. Graf, E., Azzopardi, L., Van Rijsbergen, K.: Automatically Generating Queries for Prior Art Search. In: CLEF working notes 2009 (2009)

    Google Scholar 

  7. Gobeill, J., Theodoro, D., Ruch, P.: Exploring a wide Range of simple Pre and Post Processing Strategies for Patent Searching in CLEF IP 2009. In: CLEF working notes 2009 (2009)

    Google Scholar 

  8. Piroi, F., Roda, G., Zenz, V.: CLEF-IP 2009 Evaluation Summary. Technical report, Information Retrieval Facility (2009)

    Google Scholar 

  9. Tseng, Y., Wu, Y.: A study of search tactics for patentability search: a case study on patent engineers. In: Proceeding of the 1st ACM workshop on Patent information retrieval, pp. 33–36 (2008)

    Google Scholar 

  10. D’hondt, E., Verberne, S., Oostdijk, N., Boves, L.: Re-ranking based on Syntactic Dependencies in Prior-Art Retrieval. In: Proceedings of the Dutch-Belgium Information Retrieval Workshop (to appear, 2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Verberne, S., D’hondt, E. (2010). Prior Art Retrieval Using the Claims Section as a Bag of Words. In: Peters, C., et al. Multilingual Information Access Evaluation I. Text Retrieval Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6241. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15754-7_60

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15754-7_60

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15753-0

  • Online ISBN: 978-3-642-15754-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics