Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Patent Retrieval

Published: 20 February 2013 Publication History

Abstract

Intellectual property and the patent system in particular have been extremely present in research and discussion, even in the public media, in the last few years. Without going into any controversial issues regarding the patent system, we approach a very real and growing problem: searching for innovation. The target collection for this task does not consist of patent documents only, but it is in these documents that the main difference is found compared to web or news information retrieval. In addition, the issue of patent search implies a particular user model and search process model. This review is concerned with how research and technology in the field of Information Retrieval assists or even changes the processes of patent search. It is a survey of work done on patent data in relation to Information Retrieval in the last 20–25 years. It explains the sources of difficulty and the existing document processing and retrieval methods of the domain, and provides a motivation for further research in the area.

References

[1]
"The clueweb dataset," http://lemurproject.org/clueweb09.php/index.php.
[2]
"Manual of patent examination procedure, section 608.01(m)," Revision July 2010, http://www.uspto.gov/web/offices/pac/mpep/index.htm.
[3]
"Understanding intellectual property," http://www.wipo.int/about-ip/en/.
[4]
PATENT '03: Proceedings of the ACL-2003 Workshop on Patent Corpus Processing. Association for Computational Linguistics, 2003.
[5]
PaIR '11: Proceedings of the Workshop on Patent Information Retrieval. ACM, 2011.
[6]
S. Adams, "Comparing the IPC and the US classification systems for the patent searcher," World Patent Information, vol. 23, pp. 15-23, 2001.
[7]
S. Adams, "Electronic non-text material in patent applications-some questions for patent offices, applicants and searchers," World Patent Information, vol. 27, pp. 99-103, 2005.
[8]
S. Adams, "New methodologies for patent searching; what do we need?," in Proceedings of the Global Symposium of Intellectual Property Authorities, 2009.
[9]
S. Adams, "The text, the full text and nothing but the text: Part 1 -- standards for creating textual information in patent documents and general search implications," World Patent Information, vol. 32, pp. 22-29, 2010.
[10]
S. Adams, "The text, the full text and nothing but the text: Part 2 -- standards for creating textual information in patent documents and general search implications," World Patent Information, vol. 32, pp. 120-128, 2010.
[11]
S. Adams, Information Sources in Patents. G. Saur, 3rd ed., 2011.
[12]
M. Agatonovic, N. Aswani, K. Bontcheva, H. Cunningham, T. Heitz, Y. Li, I. Roberts, and V. Tablan, "Large-scale, parallel automatic patent annotation," in Proceedings of Workshop on Patent Information Retrieval, pp. 1-8, 2008.
[13]
F. Aiolli, "A preference model for structured supervised learning tasks," in Proceedings of IEEE International Conference on Data Mining, pp. 557-560, 2005.
[14]
F. Aiolli, R. Cardin, F. Sebastiani, and A. Sperduti, "Preferential text classification: Learning algorithms and evaluation measures," Information Retrieval, vol. 12, pp. 559-580, 2009.
[15]
F. Aiollo and A. Sperduti, "Learning preferences for multiclass problems," in Advances in Neural Information Processing Systems, pp. 17-24, 2005.
[16]
D. Alberts, C. B. Yang, D. Fobare-DePonio, K. Koubek, S. Robins, M. Rodgers, E. Simmons, and D. DeMarco, "Introduction to patent searching -- practical experience and requirements for searching the patent space," in Current Challenges in Patent Information Retrieval, (M. Lupu, K. Mayer, J. Tait, and A. J. Trippe, eds.), Springer, pp. 3-43, 2011.
[17]
L. Andersson, "A vector space analysis of Swedish patent claims with different linguistic indices," in Proceedings of Workshop on Patent Information Retrieval, ACM, pp. 47-56, 2010.
[18]
M. Annies, "Full-text prior art and chemical structure searching in e-journals and on the internet -- a patent information professional's perspective," World Patent Information, vol. 31, pp. 278-284, 2009.
[19]
K. H. Atkinson, "Towards a more rational patent search paradigm," in Proceedings of Workshop on Patent Information Retrieval, 2008.
[20]
L. Azzopardi, W. Vanderbauwhede, and H. Joho, "Search system requirements of patent analysts," in Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 775-776, 2010.
[21]
L. Azzopardi and V. Vinay, "Accessibility in information retrieval," Advances in Information Retrieval, pp. 482-489, 2008.
[22]
R. H. Baayen, R. Piepenbrock, and L. Gulikers, "The CELEX Lexical database (release 2)," 1995.
[23]
R. Bache and L. Azzopardi, "Improving access to large patent corpora," in Transactions on Large-Scale Data- and Knowledge-Centered Systems II, Springer, pp. 103-121, 2010.
[24]
R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval. Addison Wesley, 2nd ed., 2010.
[25]
J. M. Barnard and G. M. Downs, "Use of Markush structure techniques to avoid enumeration in diversity analysis of large combinatorial libraries," http://www.daylight.com/meetings/mug97/Barnard/970227JB.html, last checked, March 2012.
[26]
J. M. Barnard and P. M. Wright, "Towards in-house searching of Markush structures from patents," World Patent Information, vol. 31, pp. 97-103, 2009.
[27]
S. Bashir and A. Rauber, "On the relationship between query characteristics and IR functions retrieval bias," Journal of the American Society for Information Science and Technology, vol. 62, pp. 1515-1532, 2011.
[28]
D. Becks, T. Mandl, and C. Womser-Hacker, "Phrases or terms? the impact of different query types," in Cross-Language Evaluation Forum (Notebook Papers/Labs/Workshop), 2010.
[29]
K. Benzineb and J. Guyot, "Automated patent classification," in Current Challenges in Patent Information Retrieval, (M. Lupu, K. Mayer, J. Tait, and A. J. Trippe, eds.), Springer, pp. 239-261, 2011.
[30]
K. Beuls, B. Pflugfelder, and A. Hanbury, "Comparative analysis of balanced winnow and svm in large scale patent categorization," in Proceedings of Dutch-Belgian Information Retrieval Workshop (DIR), pp. 8-15, 2010.
[31]
N. Bhatti and A. Hanbury, "Image search in patents: A review," International Journal on Document Analysis and Recognition (IJDAR), 2012.
[32]
M. Blackman, R. Honeywood, and K. Milne, "Searching organic chemical structures: A comparison of online access to chemical abstracts (CAS) and the corresponding United Kingdom Patent Office search system (c2c)," World Patent Information, vol. 8, pp. 20-28, 1986.
[33]
A. Blanchard, "Understanding and customizing stopword lists for enhanced patent mapping," World Patent Information, vol. 29, pp. 308-316, 2007.
[34]
G. H. Blosser, N. Arshadi, and S. Agrawal, "A critical assessment of the USPTO policies toward small entity patent applications," Technology and Information, vol. 13, 2011.
[35]
D. Bonino, A. Ciaramella, and F. Corno, "Review of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics," World Patent Information, vol. 32, pp. 30-38, 2010.
[36]
N. Bouayad-Agha, G. Casamayor, G. Ferraro, S. Mille, V. Vidal, and L. Wanner, "Improving the comprehension of legal documentation: The case of patent claims," in Proceedings of International Conference on Artificial Intelligence and Law, pp. 78-87, 2009.
[37]
A. Browne, A. McCray, and S. Srinivasan, "The specialist lexicon," Technical Report, National Library of Medicine, 2000.
[38]
A. Buchanan, N. H. Packard, and M. A. Bedau, "Measuring the evolution of the drivers of technological innovation in the patent record," Artificial Life, vol. 17, pp. 109-122, 2011.
[39]
L. Cai and T. Hofmann, "Hierarchical document categorization with support vector machines," in Proceedings of International Conference on Information and Knowledge Management, pp. 78-87, 2004.
[40]
L. Cai and T. Hofmann, "Exploiting known taxonomies in learning overlapping concepts," in Proceedings of International Joint Conference on Artifical intelligence, pp. 714-719, 2007.
[41]
A. Ceausu, J. Tinsley, A. Way, J. Zhang, and P. Sheridan, "Experiments on domain adaptation for patent machine translation in the pluto project," in Proceedings of Conference of the European Association for Machine Translation, 2011.
[42]
A. Chakrabarti, I. Dror, and N. Eakabuse, "Interorganizational transfer of knowledge: An analysis of patent citations of a defense firm," IEEE Transactions on Engineering Management, vol. 40, pp. 91-94, 1993.
[43]
L. Chen, N. Tokuda, and H. Adachi, "A patent document retrieval system addressing both semantic and syntactic properties," in Proceedings ACL Workshop on Patent Corpus Processing, 2003.
[44]
A. Chu, S. Sakurai, and A. F. Cardenas, "Automatic detection of treatment relationships for patent retrieval," in Proceedings of Workshop on Patent Information Retrieval, ACM, pp. 9-14, 2008.
[45]
G. Csurka, J.-M. Renders, and G. Jacquet, "XRCE's participation at patent image classification and image-based patent retrieval tasks of the CLEF-IP 2011," in Cross-Language Evaluation Forum (Notebook papers/ Labs/Workshop), 2011.
[46]
H. Cunningham, V. Tablan, I. Roberts, M. Greenwood, and N. Aswani, "Information extraction and semantic annotation for multi-paradigm information management," in Current Challenges in Patent Information Retrieval, (M. Lupu, K. Mayer, J. Tait, and A. J. Trippe, eds.), Springer, pp. 307-327, 2011.
[47]
M. Das, R. Manmatha, and E. M. Riseman, "Indexing flower patent images using domain knowledge," IEEE Intelligent Systems, vol. 14, pp. 24-33, 1999.
[48]
R. Datta, D. Joshi, J. Li, and J. Z. Wang, "Image retrieval: Ideas, influences, and trends of the new age," ACM Computing Surveys, vol. 40, pp. 5:1-5:60, 2008.
[49]
M.-C. de Marneffe, B. MacCartney, and C. D. Manning, "Generating typed dependency parses from phrase structure parses," in Proceedings of International Conference on Language Resources and Evaluation (LREC), 2006.
[50]
E. D'hondt and S. Verbene, "CLEF-IP 2010: Prior art retrieval using the different sections in patent documents," in Cross-Language Evaluation Forum (Notebook Papers/LABs/Workshops), 2010.
[51]
E. D'hondt, S. Verberne, W. Alink, and R. Cornacchia, "Combining document representations for prior-art retrieval," in Cross-Language Evaluation Forum (Notebook Papers/Labs/Workshop), 2011.
[52]
C. Emmerich, "Comparing first level patent data with value-added patent information: A case study in the pharmaceutical field," World Patent Information, vol. 31, pp. 117-122, 2009.
[53]
C. J. Fall, A. Törcsvári, K. Benzineb, and G. Karetka, "Automated categorization in the international patent classification," SIGIR Forum, vol. 37, pp. 10-25, 2003.
[54]
I. V. Filippov and M. C. Nicklaus, "Optical structure recognition software to recover chemical information: Osra, an open source solution," Journal of Chemical Information and Modeling, vol. 49, pp. 740-743, 2009.
[55]
A. Fujii, "Enhancing patent retrieval by citation analysis," in Proceedings International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 793-794, 2007.
[56]
A. Fujii and T. Ishikawa, "Patent retrieval experiments at ULIS," in Proceedings of NII Test Collection for IR Systems-3, 2002.
[57]
A. Fujii, M. Iwayama, and N. Kando, "Overview of patent retrieval task at NTCIR-4," in Proceedings of NII Test Collection for IR Systems-4, 2004.
[58]
A. Fujii, M. Iwayama, and N. Kando, "Test collections for patent-to-patent retrieval and patent map generation in NTCIR-4 workshop," in Proceedings of International Conference on Language Resources and Evaluation (LREC), 2004.
[59]
A. Fujii, M. Iwayama, and N. Kando, "Introduction to the special issue on patent processing," Information Processing and Management, vol. 43, pp. 1149-1153, 2007.
[60]
A. Fujii, M. Iwayama, and N. Kando, "Overview of the patent retrieval task at the NTCIR-6 workshop," in Proceedings of NII Test Collection for IR Systems- 6, 2007.
[61]
S. Fujita, "Revisiting document length hypotheses: A comparative study of Japanese newspaper and patent retrieval," ACM Transactions on Asian Language Information Processing, vol. 4, pp. 207-235, June 2005.
[62]
G. W. Furnas, S. T. Dumais, T. K. Landauer, R. A. Harshman, L. A. Streeter, and K. E. Lochbaum, "Information retrieval using singular value decomposition model of latent semantic structure," in Proceedings of SIGIR, 1988.
[63]
D. Ganguly, J. Leveling, and G. Jones, "United we fall, divided we stand: A study of query segmentation and PRF for patent prior art search," in Proceedings of PaIR, 2011.
[64]
A. Gibbs, "Boolean patent search: Comparative patent search quality/cost evaluation super Boolean vs. legacy Boolean search engines," Technical Report, http://patentcafe.com, 2006.
[65]
M. Giereth, S. Brügmann, A. Stäbler, M. Rotard, and T. Ertl, "Application of semantic technologies for representing patent metadata," in International Workshop on Applications of Semantic Technologies, 2006.
[66]
M. Giereth, S. Koch, Y. Kompatsiaris, S. Papadopoulos, E. Pianta, L. Serafini, and L. Wanner, "A modular framework for ontology-based representation of patent information," in Proceedings of the Conference on Legal Knowledge and Information Systems, 2007.
[67]
M. Giereth, S. Koch, M. Rotard, and T. Ertl, "Web based visual exploration of patent information," in International Conference on Information Visualization, 2007 (IV '07), pp. 150-155, July 2007.
[68]
J. Gobeill, E. Pasche, D. Teodoro, and P. Ruch, "Simple pre and post processing strategies for patent searching in CLEF intellectual property track 2009," in Proceedings of the Cross-language Evaluation Forum Conference on Multilingual Information Access Evaluation: Text Retrieval Experiments, Springer, pp. 444-451, 2009.
[69]
E. Graf, I. Frommholz, M. Lalmas, and K. van Rijsbergen, "Knowledge modeling in prior art search," in Advances in Multidisciplinary Retrieval, Springer, pp. 31-46, 2010.
[70]
T. Grego, P. Pezik, F. M. Couto, and D. Rebholz-Schuhmann, Identification of Chemical Entities in Patent Documents, Vol. 5518 of LNCS. pp. 942-949, Springer, 2009.
[71]
H. Gurulingappa, B. Mueller, M. Hofmann-Apitius, and J. Fluck, "Information retrieval framework for technology survey in biomedical and chemistry literature," in Proceedings of Text Retrieval Conference, 2011.
[72]
H. Gurulingappa, B. Mueller, M. Hofmann-Apitius, R. Klinger, H.-T. Mevissen, C. M. Friedrich, and J. Fluck, "Prior art search in chemistry patents based on semantic concepts and co-citation analysis," in Proceedings of Text Retrieval Conference, 2010.
[73]
H. Gurulingappa, B. Müller, R. Klinger, H.-T. Mevissen, M. Hofmann-Apitius, J. Fluck, and C. Friedrich, "Patent retrieval in chemistry based on semantically tagged named entities," in Proceedings of Text Retrieval Conference, 2009.
[74]
J. Guyot, G. Falquet, and K. Benzineb, "UniGE experiments on prior art search in the field of patents," in Multilingual Information Access Evaluation I. Text Retrieval Experiments, Springer, pp. 502-507, 2010.
[75]
A. Hanbury, N. Bhatti, M. Lupu, and R. Mörzinger, "Patent image retrieval: A survey," in Proceedings of Workshop on Patent Information Retrieval, ACM, pp. 3-8, 2011.
[76]
A. Hanbury, V. Zenz, and H. Berger, "1st international workshop on advances in patent information retrieval (AsPIRe'10)," SIGIR Forum, vol. 44, pp. 19-22, 2010.
[77]
P. Hansen, "The information seeking and retrieval process at the Swedish patent office: Moving from lab-based to real life work-task environment," in Proceedings of Workshop on Patent Retrieval, 2000.
[78]
D. Harman, Information Retrieval Evaluation. Morgan & Claypool Publishers, 2011.
[79]
C. G. Harris, R. Arens, and P. Srinivasan, "Comparison of IPC and USPC classification systems in patent prior art searches," in Proceedings of Workshop on Patent Information Retrieval, ACM, pp. 27-32, 2010.
[80]
C. G. Harris, R. Arens, and P. Srinivasan, "Using classification code hierarchies for patent prior art searches," in Current Challenges in Patent Information Retrieval, (M. Lupu, K. Mayer, J. Tait, and A. J. Trippe, eds.), Springer, pp. 287-304, 2011.
[81]
C. G. Harris, S. Foster, R. Arens, and P. Srinivasan, "On the role of classification in patent invalidity searches," in Proceedings of Workshop on Patent Information Retrieval, pp. 29-32, 2009.
[82]
Z.-L. He and M. Deng, "The evidence of systematic noise in non-patent references: A study of New Zealand companies' patents," Scientometrics, vol. 72, pp. 149-166, 2007.
[83]
B. Herbert, G. Szarvas, and I. Gurevych, "Prior art search using international patent classification codes and all-claims-queries," in Multilingual Information Access Evaluation I. Text Retrieval Experiments, Springer, pp. 452-459, 2010.
[84]
J. D. Holliday and P. Willet, "Representation and searching of chemical-structure information in patents," in Current Challenges in Patent Information Retrieval, (M. Lupu, K. Mayer, J. Tait, and A. J. Trippe, eds.), Springer, 2011.
[85]
V. Hristidis, E. Ruiz, A. Hernández, F. Farfán, and R. Varadarajan, "Patentssearcher: A novel portal to search and explore patents," in Proceedings of Workshop on Patent Information Retrieval, ACM, pp. 33-38, 2010.
[86]
B. Huet, G. Guarascio, N. J. Kern, and B. Mérialdo, "Relational skeletons for retrieval in patent drawings," in Proceedings of International Conference on Image Processing, pp. 737-740, 2001.
[87]
D. Hull, S. Aït-Mokhtar, M. Chuat, A. Eisele, E. Gaussier, G. Grefenstette, P. Isabelle, C. Samuelsson, and F. Segond, "Language technologies and patent search and classification," World Patent Information, vol. 23, pp. 265-268, 2001.
[88]
N. Ide and K. Suderman, "Integrating linguistic resources: The american national corpus model," in Proceedings of International Conference on Language Resources and Evaluation (LREC), 2006.
[89]
H. Itoh, H. Mano, and Y. Ogawa, "Term distillation in patent retrieval," in Proceedings of ACL Workshop on Patent Corpus Processing, 2003.
[90]
M. Iwayama, A. Fuji, and N. Kando, "Overview of classification subtask at NTCIR-6 patent retrieval task," in Proceedings of NII Test Collection for IR Systems-6, 2007.
[91]
M. Iwayama, A. Fujii, and N. Kando, "Overview of classification subtask at NTCIR-5 patent retrieval task," in Proceedings of NII Test Collection for IR Systems-5, Tokyo, Japan, 2005.
[92]
M. Iwayama, A. Fujii, N. Kando, and Y. Marukawa, "An empirical study on retrieval models for different document genres: patents and newspaper articles," in Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 251-258, 2003.
[93]
M. Iwayama, A. Fujii, N. Kando, and A. Takano, "Overview of patent retrieval task at NTCIR-3," in Proceedings of ACL Workshop on Patent Corpus Processing, 2003.
[94]
A. Järvelin, G. Eriksson, P. Hansen, T. Tsikrika, A. G. S. de Herrera, M. Lupu, M. Gäde, V. Petras, S. Rietberger, M. Braschler, and R. Berendsen, "Deliverable 2.2 revised specification of the evaluation tasks," Technical Report, PROMISE Network of Excellence, 2012.
[95]
C. Jochim, C. Lioma, and H. Schütze, "Expanding queries with term and phrase translations in patent retrieval," in Multidisciplinary Information Retrieval, Springer, pp. 16-29, 2011.
[96]
H. Joho and M. Sanderson, "Document frequency and term specificity," in RIAO: Large Scale Semantic Access to Content (Text, Image, Video, and Sound), 2007.
[97]
N. Kando and M.-K. Leong, "Workshop on patent retrieval (SIGIR 2000 workshop report)," SIGIR Forum, vol. 34, pp. 28-30, 2000.
[98]
I.-S. Kang, S.-H. Na, J. Kim, and J.-H. Lee, "Cluster-based patent retrieval," Information Processing and Management, vol. 43, pp. 1173-1182, September 2007.
[99]
N. Kemp and M. Lynch, "Extraction of information from text of chemical patents. 1. identification of specific chemical names," Journal of Chemical Information and Computer Sciences, vol. 38, 1998.
[100]
Y. Kim, J. Seo, and W. B. Croft, "Automatic Boolean query suggestion for professional search," in Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 825-834, 2011.
[101]
K. Kishida, "Pseudo relevance feedback method based on Taylor expansion of retrieval function in NTCIR-3 patent retrieval task," in Proceedings of the ACL Workshop on Patent Corpus Processing, 2003.
[102]
K. Kishida, K.-H. Chen, S. Lee, H.-H. Chen, N. Kando, K. Kuriyama, S. H. Myaeng, and K. Eguchi, "Cross-lingual information retrieval (CLIR) task at the NTCIR workshop 3," SIGIR Forum, vol. 38, pp. 17-20, July 2004.
[103]
I. Klampanos, H. Azzam, and T. Roelleke, "A case for probabilistic logic for scalable patent retrieval," in Proceedings of Workshop on Patent Information Retrieval, 2009.
[104]
R. Klinger, C. Kolá¿ik, J. Fluck, M. Hofmann-Apitius, and C. M. Friedrich, "Detection of iupac and iupac-like chemical names," in Proceedings of International Conference on Intelligent Systems for Molecular Biology (ISMB), 2008.
[105]
S. Koch, H. Bosch, M. Giereth, and T. Ertl, "Iterative integration of visual insights during scalable patent search and analysis," Transactions on Visualization and Computer Graphics, vol. 17, 2011.
[106]
C. H. A. Koster, "Text mining for intellectual property," in Proceedings of Dutch-Belgian Information Retrieval Workshop (DIR), 2010.
[107]
C. H. A. Koster, M. Seutter, and J. Beney, "Multi-classification of patent applications with Winnow," in Perspectives of System Informatics, Springer, pp. 546-555, 2004.
[108]
A. Krishnan, A. F. Cardenas, and D. Springer, "Search for patents using treatment and causal relationships," in Proceedings of Workshop on Patent Information Retrieval, New York, NY, USA, pp. pp. 1-10, 2010.
[109]
J.-C. Lamirel, S. A. Shehabi, M. Hoffmann, and C. Francois, "Intelligent patent analysis through the use of a neural network: Experiment of multi-viewpoint analysis with the multisom model," in Proceedings of the ACL Workshop on Patent Corpus Processing, 2003.
[110]
L. S. Larkey, "A patent search and classification system," in Proceedings of ACM Conference on Digital Libraries, pp. 179-187, 1999.
[111]
A. Leach and V. Gillet, An Introduction to Chemoinformatics. Springer, 2007.
[112]
G. Leech, "100 million words of english: The british national corpus," Language Research, vol. 28, 1992.
[113]
J. Leveling, W. Magdy, and G. J. Jones, "An investigation of decompounding for cross-language patent search," in Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1169-1170, 2011.
[114]
L. Li and C. L. Tan, "Associating figures with descriptions for patent documents," in Proceedings of the IAPR International Workshop on Document Analysis Systems, pp. 385-392, 2010.
[115]
Y. Li and J. Shawe-Taylor, "Advanced learning algorithms for cross-language patent retrieval and classification," Information Processing and Management, vol. 43, pp. 1183-1199, 2007.
[116]
Y.-R. Li, L.-H. Wang, and C.-F. Hong, "Extracting the significant-rare keywords for patent analysis," Expert Systems with Applications, vol. 36, 2009.
[117]
W. A. Lise, "An investigation of terminology and syntax in Japanese and US patents and the implications for the patent translator," http://www.lise.jp/patsur.html, 2011. Last visited: September, 4, 2012.
[118]
J. List, "How drawings could enhance retrieval in mechanical and device patent searching," World Patent Information, vol. 29, pp. 210-218, 2007.
[119]
P. Lopez, "Automatic extraction and resolution of bibliographical references in patent documents," in Advances in Multidisciplinary Retrieval, Springer Berlin/Heidelberg, pp. 120-135, 2010.
[120]
P. Lopez and L. Romary, "Experiments with citation mining and key-term extraction for prior art search," in Cross-Language Evaluation Forum (Notebook Papers/Labs/Workshop), 2010.
[121]
P. Lopez and L. Romary, "Patatras: Retrieval model combination and regression models for prior art search," in Multilingual Information Access Evaluation I. Text Retrieval Experiments, Springer, 2010.
[122]
B. Lu, B. K. Tsou, T. Jiang, O. Y. Kwong, and J. Zhu, "Mining large-scale parallel corpora from multilingual patents: An English-Chinese example and its application to SMT," in Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, pp. 79-86, 2010.
[123]
X. Lu, S. Kataria, W. J. Brouwer, J. Z. Wang, P. Mitra, and C. L. Giles, "Automated analysis of images in documents for intelligent document search," International Journal on Document Analysis and Recognition, vol. 12, pp. 65-81, 2009.
[124]
M. Lupu, "The status of retrieval evaluation in the patent domain," in Proceedings of Workshop on Patent Information Retrieval, 2011.
[125]
M. Lupu, "Patolympics -- an infrastructure for interactive evaluation of patent retrieval tools," in Proceedings of CIKM Workshop on Data infrastructures for Supporting Information Retrieval Evaluation (DESIRE), 2011.
[126]
M. Lupu, Z. Jiashu, J. Huang, H. Gurulingappa, I. Filipov, and J. Tait, "Overview of the trec 2011 chemical ir track," in Proceedings of Text Retrieval Conference, 2011.
[127]
M. Lupu, K. Mayer, J. Tait, and A. J. Trippe, eds., Current Challenges in Patent Information Retrieval. Information Retrieval Series. Springer, 2011.
[128]
M. Lupu, F. Piroi, and A. Hanbury, "Aspects and analysis of patent test collections," in Proceedings of Workshop on Patent Information Retrieval, 2010.
[129]
M. Lupu, F. Piroi, J. Huang, J. Zhu, and J. Tait, "Overview of the trec chemical ir track," in Proceedings of Text Retrieval Conference, 2009.
[130]
M. Lupu, R. Schuster, R. Mörzinger, F. Piroi, T. Schleser, and A. Hanbury, "Patent images -- a glass-encased tool: Opening the case," in Proceedings of International Conference on Knowledge Management and Knowledge Technologies, pp. 16:1-16:8, 2012.
[131]
W. Magdy and G. Jones, "A study on query expansion methods for patent retrieval," in Proceedings of Workshop on Patent Information Retrieval, 2011.
[132]
W. Magdy and G. J. Jones, "Pres: A score metric for evaluating recall-oriented information retrieval applications," in Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 611-618, 2010.
[133]
W. Magdy and G. J. F. Jones, "Examining the robustness of evaluation metrics for patent retrieval with incomplete relevance judgements," in Proceedings of the 2010 International Conference on Multilingual and Multimodal Information Access Evaluation: Cross-language Evaluation Forum, Berlin, Heidelberg, pp. 82-93, 2010.
[134]
W. Magdy, J. Leveling, and G. J. F. Jones, "Exploring structured documents and query formulation techniques for patent retrieval," in Proceedings of the Cross-language Evaluation Forum Conference on Multilingual Information Access Evaluation: Text Retrieval Experiments, Springer-Verlag, pp. 410-417, 2009.
[135]
P. Mahdabi, L. Andersson, A. Hanbury, and F. Crestani, "Report on the CLEF-IP 2011 experiments: Exploring patent summarization," in Cross-Language Evaluation Forum (Notebook Papers/LABs/Workshops), 2011.
[136]
P. Mahdabi, M. Keikha, S. Gerani, M. Landoni, and F. Crestani, "Building queries for prior-art search," in Multidisciplinary Information Retrieval, Springer-Verlag, pp. 3-15, 2011.
[137]
F. Mahmoudi, J. Shanbehzadeh, A.-M. Eftekhari-Moghadam, and H. Soltanian-Zadeh, "Image retrieval based on shape similarity by edge orientation autocorrelogram," Pattern Recognition, vol. 36, pp. 1725-1736, 2003.
[138]
L. Molà, The Silk Industry of Renaissance Venice. JHU Press, 2000.
[139]
A. Moldovan, R. I. Bot, and G. Wanka, "Latent semantic indexing for patent documents," International Journal of Applied Mathematics and Computer Science, vol. 15, 2005.
[140]
G. Moradei and P. C. Contessini, "An evaluation of some semantic tools for simple patent searching," in Proceedings of Patent Information Conference, 2011.
[141]
N. Morey, "Global business solutions for patent prosecution," in Proceedings of the Symposium of Intellectual Property Authorities, WIPO, 2011.
[142]
R. Mörzinger, A. Horti, G. Thallinger, N. Bhatti, and A. Hanbury, "Classifying patent images," in Cross-Language Evaluation Forum (Notebook papers/ Labs/Workshop), 2011.
[143]
S. Mukherjea and B. Bamba, "Biopatentminer: An information retrieval system for biomedical patents," in VLDB '04: Proceedings of International Conference on Very Large Data Bases, VLDB Endowment, pp. 1066-1077, 2004.
[144]
B. Müller, R. Klinger, H. Gurulingappa, H.-T. Mevissen, M. Hofmann-Apitius, J. Fluck, and C. Friedrich, "Abstracts versus full texts and patents: A quantitative analysis of biomedical entities," in Advances in Multidisciplinary Retrieval, Springer, pp. 152-165, 2010.
[145]
H. Nanba, A. Fujii, M. Iwayama, and T. Hashimoto, "Overview of the patent mining task at the NTCIR-7 workshop," in Proceedings of NII Test Collection for IR Systems-7, 2008.
[146]
H. Nanba, A. Fujii, M. Iwayama, and T. Hashimoto, "Overview of the patent retrieval task at the NTCIR-8 workshop," in Proceedings of NII Test Collection for IR Systems-8, 2010.
[147]
H. Nanba, H. Kamaya, T. Takezawa, M. Okumura, A. Shinmori, and H. Tanigawa, "Automatic translation of scholarly terms into patent terms," in Current Challenges in Patent Information Retrieval, (M. Lupu, K. Mayer, J. Tait, and A. J. Trippe, eds.), Springer, pp. 373-388, 2011.
[148]
H. Nanba, S. Mayumi, and T. Takezawa, "Automatic construction of a bilingual thesaurus using citation analysis," in Proceedings of Workshop on Patent Information Retrieval, ACM, pp. 25-30, 2011.
[149]
J.-Y. Nie, Cross-Language Information Retrieval. Morgan & Claypool Publishers, 2010.
[150]
N. Oostdijk, E. D'hondt, H. van Halteren, and S. Verberne, "Genre and domain in patent texts," in Proceedings of Workshop on Patent Information Retrieval, ACM, pp. 39-46, 2010.
[151]
N. Oostdijk, S. Verberne, and C. Koster, "Constructing a broad-coverage lexicon for text mining in the patent domain," in Proceedings of International Conference on Language Resources and Evaluation (LREC), 2010.
[152]
M. Osborn, T. Strzalkowski, and M. Marinescu, "Evaluating document retrieval in patent database: A preliminary report," in Proceedings of Conference on Information and Knowledge Management, pp. 216-221, 1997.
[153]
P. Parapatics and M. Dittenbach, "Patent claim decomposition for improved information extraction," in Proceedings of Workshop on Patent Information Retrieval, ACM, pp. 33-36, 2009.
[154]
J. Perez-Iglesias, A. Rodrigo, and V. Fresno, "Using bm25f and kld for patent retrieval," in Cross-Language Evaluation Forum (Notebook Papers/ LABs/Workshops), 2010.
[155]
F. Perronnin and C. Dance, "Fisher kernels on visual vocabularies for image categorization," in IEEE Conference on Computer Vision and Pattern Recognition, 2007 (CVPR '07), pp. 1-8, 2007.
[156]
F. Piroi, M. Lupu, A. Hanbury, A. Sexton, W. Magdy, and I. Filippov, "Clef-ip 2012: Retrieval experiments in the intellectual property domain," in Cross-Language Evaluation Forum (Notebook papers/Labs/Workshop), 2012.
[157]
F. Piroi, M. Lupu, A. Hanbury, and V. Zenz, "Clef-ip 2011: Retrieval in the intellectual property domain," in Cross-Language Evaluation Forum (Notebook Papers/Labs/Workshop), 2011.
[158]
F. Piroi and J. Tait, "Clef-ip 2010: Retrieval experiments in the intellectual property domain," in Cross-Language Evaluation Forum (Notebook papers/Labs/Workshop), 2010.
[159]
S. Robertson and H. Zaragoza, "The probabilistic relevance framework: BM25 and beyond," Foundations and Trends in Information Retrieval, vol. 3, 2009.
[160]
S. E. Robertson, "The probability ranking principle in ir," Journal of Documentation, vol. 33, 1977.
[161]
G. Roda, J. Tait, F. Piroi, and V. Zenz, "Clef-ip 2009: Retrieval experiments in the intellectual property domain," in Multilingual Information Access Evaluation I. Text Retrieval Experiments, Springer, pp. 385-409, 2010.
[162]
J. Rousu, C. Saunders, S. Szedmák, and J. Shawe-Taylor, "Kernel-based learning of hierarchical multilabel classification models," Journal of Machine Learning Research, vol. 7, pp. 1601-1626, 2006.
[163]
J. Ryley, "Latent semantic indexing for patent information," http://cogprints. org/5710/1/ryley.html, 2007. Last accessed: September, 4, 2012.
[164]
M. Sahlgren, P. Hansen, and J. Karlgren, "English-japanese cross-lingual query expansion using random indexing of aligned bilingual text data," in Proceedings of NII Test Collection for IR Systems, 2002.
[165]
G. Salton and C. Buckley, "Term-weighting approaches in automatic text retrieval," Information Processing and Management, vol. 24, pp. 513-523, 1988.
[166]
M. Sanderson, "Test collection based evaluation of information retrieval systems," Foundations and Trends in Information Retrieval, vol. 4, 2010.
[167]
M. Sanderson, M. L. Paramita, P. Clough, and E. Kanoulas, "Do user preferences and evaluation measures line up?," in Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010.
[168]
I. Schellner, "Japanese File Index classification and f-terms," World Patent Information, vol. 24, pp. 197-201, 2002.
[169]
M. W. Seeger, "Cross-validation optimization for large scale hierarchical classification kernel methods," in Proceedings of Neural Information Processing Systems (NIPS), pp. 1233-1240, 2007.
[170]
S. Sheremetyeva, "Natural language analysis of patent claims," in Proceedings of ACL Workshop on Patent Corpus Processing, 2003.
[171]
S. Sheremetyeva and S. Nirenburg, "Knowledge elecitation for authoring patent claims," IEEE Computer, vol. 29, pp. 57-63, 1996.
[172]
S. Sheremetyeva, S. Nirenburg, and I. Nirenburg, "Generating patent claims from interactive input," in Proceedings of International Workshop on Natural Language Generation (INLG'96), pp. 61-70, 1996.
[173]
A. Shinmori, M. Okumura, Y. Marukawa, and M. Iwayama, "Patent claim processing for readability -- structure analysis and term explanation," in Proceedings of The ACL Workshop on Patent Corpus Processing, 2003.
[174]
P. Sidiropoulos, S. Vrochidis, and I. Kompatsiaris, "Content-based binary image retrieval using the adaptive hierarchical density histogram," Pattern Recognition, vol. 44, pp. 739-750, 2011.
[175]
A. Singh, S. Hallihosur, and L. Rangan, "Changing landscape in biotechnology patenting," World Patent Information, vol. 31, pp. 219-225, 2009.
[176]
A. Smeaton, "Nlp and ir," in European Summer School on Information Retrieval, 1995.
[177]
A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, "Content-based image retrieval at the end of the early years," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, pp. 1349-1380, 2000.
[178]
H. Smith, "Automation of patent classification," World Patent Information, vol. 24, pp. 269-271, 2002.
[179]
S. Taduri, G. T. Lau, K. H. Law, H. Yu, and J. P. Kesan, "Developing an ontology for the U.S. patent system," in Proceedings of Annual International Digital Government Research Conference: Digital Government Innovation in Challenging Times, pp. 157-166, 2011.
[180]
S. Taduri, G. T. Lau, K. H. Law, H. Yu, and J. P. Kesan, "An ontology-based interactive tool to search documents in the u.s. patent system," in Proceedings of the Annual International Digital Government Research Conference: Digital Government Innovation in Challenging Times, New York, NY, USA, pp. 329-330, 2011.
[181]
S. Taduri, H. Yu, G. Lau, K. Law, and J. Kesan, "Developing a comprehensive patent related information retrieval tool," Journal of Theoretical and Applied Electronic Commerce Research, vol. 6, pp. 1-16, August 2011.
[182]
T. Takaki, A. Fuji, and T. Ishikawa, "Associative document retrieval by query subtopic analysis and its application to invalidity patent search," in Proceedings of Conference on Information and Knowledge Management, 2004.
[183]
T. Tang, N. Craswell, D. Hawking, K. Griffiths, and H. Christensen, "Quality and relevance of domain-specific search: A case study in mental health," Information Retrieval, vol. 9, pp. 207-225, 2006.
[184]
J. Tangelder and R. Veltkamp, "A survey of content based 3D shape retrieval methods," Multimedia Tools and Applications, vol. 39, pp. 441-471, 2008.
[185]
W. Thielemann, "Ocr errors in patent full-text documents," in Proceedings of the IRF Symposium, 2007.
[186]
D. Tikk, G. Biró, and A. Törcsvári, "A hierarchical online classifier for patent categorization," in Emerging Technologies of Text Mining: Techniques and Applications, (H. A. do Prado and E. Ferneda, eds.), IGI Global, pp. 244-267, 2007.
[187]
A. Tiwari and V. Bansal, "Patseek: Content based image retrieval system for patent database," in Proceedings of International Conference on Electronic Business (ICEB), pp. 1167-1171, 2004.
[188]
K. Tombre, "Analysis of engineering drawings: State of the art and challenges," in Graphics Recognition -- Algorithms and Systems, Springer, pp. 257-264, 1998.
[189]
A. Trippe and I. Ruthven, "Evaluating real patent retrieval effectiveness," in Current Challenges in Patent Information Retrieval, (M. Lupu, K. Mayer, J. Tait, and A. J. Trippe, eds.), Springer, 2011.
[190]
Y.-H. Tseng, C.-J. Lin, and Y.-I. Lin, "Text mining techniques for patent analysis," Information Processing and Management, vol. 43, pp. 1216-1247, September 2007.
[191]
Y.-H. Tseng and Y.-J. Wu, "A study of search tactics for patentability search: A case study on patent engineers," in Proceedings of Workshop on Patent Information Retrieval, pp. 33-36, 2008.
[192]
I. Tsochantaridis, T. Hofmann, T. Joachims, and Y. Altun, "Support vector machine learning for interdependent and structured output spaces," in Proceedings of International Conference on Machine Learning, pp. 104-111, I. 2004.
[193]
J. Urbain and O. Frieder, "Trec chemical ir track 2009: A distributed dimensional indexing model for chemical patent search," in Proceedings of Text Retrieval Conference, 2009.
[194]
USPTO, "Acceptance, processing, use and dissemination of chemical and three-dimensional biological structural data in electronic format," Biotechnology Law Report, vol. 24, pp. 638-644, 2005.
[195]
A. T. Valko and A. P. Johnson, "Clide pro: The latest generation of CLiDE, a tool for optical chemical structure recognition," Journal of Chemical Information and Modeling, vol. 49, pp. 780-787, 2009.
[196]
S. Verberne and E. D'hondt, "Prior art retrieval using the claims section as a bag of words," in Proceedings of the Cross-language Evaluation Forum Conference on Multilingual Information Access Evaluation: Text Retrieval Experiments, Springer, pp. 497-501, 2009.
[197]
S. Verberne, E. D'hondt, N. Oostdijk, and C. Koster, "Quantifying the challenges in parsing patent claims," in Proceedings of Workshop on Advances in Patent Information Retrieval (AsPIRe), 2010.
[198]
S. Verberne, M. Vogel, and E. D'hondt, "Patent classification experiments with the linguistic classification system lcs," in Working Notes of the CLEF Labs, 2010.
[199]
M. Verma and V. Varma, "Applying key phrase extraction to aid invalidity search," in Proceedings of International Conference on Artificial Intelligence and Law, pp. 249-255, 2011.
[200]
S. V. Vishwanathan, N. N. Schraudolph, and A. J. Smola, "Step size adaptation in reproducing kernel Hilbert space," Journal of Machine Learning Research, vol. 7, pp. 1107-1133, 2006.
[201]
S. Vrochidis, S. Papadopoulos, A. Moumtzidou, P. Sidiropoulos, E. Pianta, and I. Kompatsiaris, "Towards content-based patent image retrieval: A framework perspective," World Patent Information, vol. 32, pp. 94-106, 2010.
[202]
M. Z. Wanagiri and M. Adriani, "Prior art retrieval using various patent document fields contents," in Cross-Language Evaluation Forum (Notebook Papers/LABs/Workshops), 2010.
[203]
X. Wang, X. Zhang, and S. Xu, "Patent co-citation networks of fortune 500 companies," Scientometrics, vol. 88, pp. 761-770, 2011.
[204]
L. Wanner, R. Baeza-Yates, S. Brügmann, J. Codina, B. Diallo, E. Escorsa, M. Giereth, Y. Kompatsiaris, S. Papadopoulos, E. Pianta, G. Piella, I. Puhlmann, G. Rao, M. Rotard, P. Schoester, L. Serafini, and V. Zervaki, "Towards content-oriented patent document processing," World Patent Information, vol. 30, pp. 21-33, 2008.
[205]
H. Wongel, "The reform of the IPC -- consequences for the users," World Patent Information, vol. 27, pp. 227-231, 2005.
[206]
World Intellectual Property Organisation, "Glossary of terms concerning industrial property information and documentation, appendix iii to part 10, of the wipo handbook on industrial property information and documentation," I. WIPO Publication No. CD208, 2003.
[207]
World Intellectual Property Organisation, "Wipo handbook on intellectual property, chapter 5," http://www.wipo.int/export/sites/www/about-ip/en/iprm/pdf/ch5.pdf, last visited August, 2012.
[208]
X. Xue and W. B. Croft, "Automatic query generation for patent search," in Proceedings of Conference on Information and Knowledge Management, pp. 2037-2040, 2009.
[209]
X. Xue and W. B. Croft, "Transforming patents into prior-art queries," in Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 808-809, 2009.
[210]
M. Yang, G. Qiu, J. Huang, and D. Elliman, "Near-duplicate image recognition and content-based image retrieval using adaptive hierarchical geometric centroids," in International Conference on Pattern Recognition, 2006 (ICPR 2006), pp. 958-961, 2006.
[211]
S.-Y. Yang and V.-W. Soo, "Comparing the conceptual graphs extracted from patent claims," in IEEE International Conference on Sensor Networks, Ubiquitous and Trustworthy Computing (SUTC), pp. 394-399, 2008.
[212]
Y. Yang, L. Akers, T. Klose, and C. Barcelonyang, "Text mining and visualization tools -- impressions of emerging capabilities," World Patent Information, vol. 30, pp. 280-293, December 2008.
[213]
F. Zaccá and M. Krier, "Automatic categorisation applications at the European Patent Office," World Patent Information, vol. 24, pp. 187-196, 2002.
[214]
V. Zenz, S. Wurzer, M. Dittenbach, and E. Ambrosi, "On the effects of indexing and retrieval models in patent search and the potential of result set merging," in Proceedings of Workshop on Advances in Patent Information Retrieval (AsPIRe), 2010.
[215]
L. Zhao and J. Callan, "Formulating simple structured queries using temporal and distributional cues in patents," in Proceedings of Text Retrieval Conference, 2009.
[216]
L. Zhao and J. Callan, "How to make manual conjunctive normal form queries work in patents search," in Proceedings of Text Retrieval Conference, 2011.
[217]
Z. Zhiyuan, Z. Juan, and X. Bin, "An outward-appearance patent-image retrieval approach based on the contour-description matrix," in Proceedings of Japan-China Joint Workshop on Frontier of Computer Science and Technology (FCST), IEEE Computer Society, pp. 86-89, 2007.
[218]
Y. Zhou and C. L. Tan, "Chart analysis and recognition in document images," in Proceedings of International Conference on Document Analysis and Recognition (ICDAR), pp. 1055-1058, 2001.
[219]
G. Zhu, X. Yu, Y. Li, and D. Doermann, "Learning visual shape lexicon for document image content recognition," in Proceedings of European Conference on Computer Vision (ECCV), Springer, pp. 745-758, 2008.

Cited By

View all
  • (2024)Ranked List Truncation for Large Language Model-based Re-RankingProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657864(141-151)Online publication date: 10-Jul-2024
  • (2022)MtCutProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3498466(1054-1062)Online publication date: 11-Feb-2022
  • (2022)Summarization, simplification, and generationExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.117627205:COnline publication date: 1-Nov-2022
  • Show More Cited By

Index Terms

  1. Patent Retrieval
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Foundations and Trends in Information Retrieval
    Foundations and Trends in Information Retrieval  Volume 7, Issue 1
    February 2013
    99 pages
    ISSN:1554-0669
    EISSN:1554-0677
    Issue’s Table of Contents

    Publisher

    Now Publishers Inc.

    Hanover, MA, United States

    Publication History

    Published: 20 February 2013

    Author Tags

    1. Databases/Information Retrieval
    2. Machine Learning
    3. Ra
    4. Rd

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Ranked List Truncation for Large Language Model-based Re-RankingProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657864(141-151)Online publication date: 10-Jul-2024
    • (2022)MtCutProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3498466(1054-1062)Online publication date: 11-Feb-2022
    • (2022)Summarization, simplification, and generationExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.117627205:COnline publication date: 1-Nov-2022
    • (2022)PatentNet: multi-label classification of patent documents using deep learning based language understandingScientometrics10.1007/s11192-021-04179-4127:1(207-231)Online publication date: 1-Jan-2022
    • (2022)End to End Neural Retrieval for Patent Prior Art SearchAdvances in Information Retrieval10.1007/978-3-030-99739-7_66(537-544)Online publication date: 10-Apr-2022
    • (2019)Enriching Word Embeddings for Patent Retrieval with Global ContextAdvances in Information Retrieval10.1007/978-3-030-15712-8_57(810-818)Online publication date: 14-Apr-2019
    • (2018)User Interactions with Search SystemsProceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3234944.3234981(3-4)Online publication date: 10-Sep-2018
    • (2018)PatSearchKnowledge and Information Systems10.1007/s10115-017-1127-057:1(135-158)Online publication date: 1-Oct-2018
    • (2017)Information-analytical support of priority fields in science and technologyScientific and Technical Information Processing10.3103/S014768821702008344:2(119-124)Online publication date: 1-Apr-2017
    • (2017)Reply WithProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3132979(327-336)Online publication date: 6-Nov-2017
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media