Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Skip header Section
Lucene in Action (In Action series)December 2004
Publisher:
  • Manning Publications Co.
  • 3 Lewis Street Greenwich, CT
  • United States
ISBN:978-1-932394-28-3
Published:01 December 2004
Skip Bibliometrics Section
Reflects downloads up to 24 Jan 2025Bibliometrics
Abstract

No abstract available.

Cited By

  1. Rodrigues E, Paiva D and Júnior Á (2021). Recipe analysis for knowledge discovery of gastronomic dishes, Knowledge and Information Systems, 63:8, (2075-2108), Online publication date: 1-Aug-2021.
  2. Cooper N, Bernal-Cárdenas C, Chaparro O, Moran K and Poshyvanyk D It Takes Two to Tango Proceedings of the 43rd International Conference on Software Engineering, (957-969)
  3. Khalife S, Liberti L and Vazirgiannis M Geometry and Analogies: A Study and Propagation Method for Word Representations Statistical Language and Speech Processing, (100-111)
  4. Chaparro O, Florez J and Marcus A (2019). Using bug descriptions to reformulate queries during text-retrieval-based bug localization, Empirical Software Engineering, 24:5, (2947-3007), Online publication date: 1-Oct-2019.
  5. Agarwal P, Ramanath M and Shroff G Retrieving Relationships from a Knowledge Graph for Question Answering Advances in Information Retrieval, (35-50)
  6. Wimmer H and Yoon V (2017). Counterfeit product detection, Decision Support Systems, 104:C, (1-12), Online publication date: 1-Dec-2017.
  7. Borg M, Wnuk K, Regnell B and Runeson P (2017). Supporting Change Impact Analysis Using a Recommendation System: An Industrial Case Study in a Safety-Critical Context, IEEE Transactions on Software Engineering, 43:7, (675-700), Online publication date: 1-Jul-2017.
  8. Sanchez-Pi N, Martí L and Bicharra Garcia A (2016). Improving ontology-based text classification, Journal of Applied Logic, 17:C, (48-58), Online publication date: 1-Sep-2016.
  9. ACM
    Venkataraman G, Lad A, Ha-Thuc V and Arya D Instant Search Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, (1211-1214)
  10. ACM
    Chaparro O and Marcus A On the reduction of verbose queries in text retrieval based software maintenance Proceedings of the 38th International Conference on Software Engineering Companion, (716-718)
  11. Roberts K, Simpson M, Demner-Fushman D, Voorhees E and Hersh W (2016). State-of-the-art in biomedical literature retrieval for clinical cases: a survey of the TREC 2014 CDS track, Information Retrieval, 19:1-2, (113-148), Online publication date: 1-Apr-2016.
  12. Wang Y, Li Y, Pi N and Lu J Crawling Ranked Deep Web Data Sources Proceedings, Part I, of the 16th International Conference on Web Information Systems Engineering --- WISE 2015 - Volume 9418, (384-398)
  13. ACM
    Jiang L, Yu S, Meng D, Yang Y, Mitamura T and Hauptmann A Fast and Accurate Content-based Semantic Search in 100M Internet Videos Proceedings of the 23rd ACM international conference on Multimedia, (49-58)
  14. ACM
    Wang H, Liu A, Wang J, Ziebart B, Yu C and Shen W Context Retrieval for Web Tables Proceedings of the 2015 International Conference on The Theory of Information Retrieval, (251-260)
  15. ACM
    Yun J, He Y, Elnikety S and Ren S Optimal Aggregation Policy for Reducing Tail Latency of Web Search Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, (63-72)
  16. ACM
    Wimmer H and Yoon V Leveraging Technology to Improve Intent to Purchase Proceedings of the 17th International Conference on Electronic Commerce 2015, (1-5)
  17. Chuanxue Wen and Junfei Zhang (2015). Design of a Microlecture Mobile Learning System Based on Smartphone and Web Platforms, IEEE Transactions on Education, 58:3, (203-207), Online publication date: 1-Aug-2015.
  18. Petz G, Karpowicz M, Fürschuíß H, Auinger A, Stříteský V and Holzinger A (2015). Reprint of, Information Processing and Management: an International Journal, 51:4, (510-519), Online publication date: 1-Jul-2015.
  19. Petz G, Karpowicz M, Fürschuß H, Auinger A, Stříteský V and Holzinger A (2014). Computational approaches for mining user's opinions on the Web 2.0, Information Processing and Management: an International Journal, 50:6, (899-908), Online publication date: 1-Nov-2014.
  20. ACM
    Sajnani H and Lopes C Probabilistic component identification Proceedings of the 7th India Software Engineering Conference, (1-10)
  21. ACM
    Sarwar S, Abedin M, Ullah A and Al Mamun A Personalized Query Expansion for Web Search Using Social Keywords Proceedings of International Conference on Information Integration and Web-based Applications & Services, (610-614)
  22. ACM
    Roy S and Zeng W (2014). Cognitive canonicalization of natural language queries using semantic strata, ACM Transactions on Speech and Language Processing , 10:4, (1-30), Online publication date: 1-Dec-2013.
  23. Tian L, Zhang W, Bikakis A, Wang H, Yu Y, Ni Y and Cao F MeDetect Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 01, (233-240)
  24. ACM
    Lux M LIRE Proceedings of the 21st ACM international conference on Multimedia, (843-846)
  25. ACM
    Saaya Z, Rafter R, Schaal M and Smyth B The curated web Proceedings of the 7th ACM conference on Recommender systems, (101-104)
  26. ACM
    Gollub T, Hagen M, Michel M and Stein B From keywords to keyqueries Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, (981-984)
  27. ACM
    Gomes D, Costa M, Cruz D, Miranda J and Fontes S Creating a billion-scale searchable web archive Proceedings of the 22nd International Conference on World Wide Web, (1059-1066)
  28. ACM
    Costa M, Gomes D, Couto F and Silva M A survey of web archive search architectures Proceedings of the 22nd International Conference on World Wide Web, (1045-1050)
  29. ACM
    Bayati M, Gleich D, Saberi A and Wang Y (2013). Message-Passing Algorithms for Sparse Network Alignment, ACM Transactions on Knowledge Discovery from Data, 7:1, (1-31), Online publication date: 1-Mar-2013.
  30. Liu S and Chen Y (2012). MalPEFinder: fast and retrospective assessment of data breaches in malware attacks, Security and Communication Networks, 5:8, (899-915), Online publication date: 1-Aug-2012.
  31. ACM
    Hefeeda M, Gao F and Abd-Almageed W Distributed approximate spectral clustering for large-scale datasets Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing, (223-234)
  32. Moraleda J (2012). Large scalability in document image matching using text retrieval, Pattern Recognition Letters, 33:7, (863-871), Online publication date: 1-May-2012.
  33. Turchi M, De Bie T and Cristianini N (2012). An intelligent Web agent that autonomously learns how to translate, Web Intelligence and Agent Systems, 10:2, (165-178), Online publication date: 1-Apr-2012.
  34. Costa E, Ferreira R, Brito P, Bittencourt I, Holanda O, Machado A and Marinho T (2012). A framework for building web mining applications in the world of blogs, Expert Systems with Applications: An International Journal, 39:5, (4813-4834), Online publication date: 1-Apr-2012.
  35. ACM
    Ferreira R, Lima R, Melo J, Costa E, Freitas F and Pacca H RetriBlog Proceedings of the 27th Annual ACM Symposium on Applied Computing, (696-701)
  36. Bangalore S Thinking outside the box for natural language processing Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I, (1-16)
  37. Wang Y, Lu J, Liang J, Chen J and Liu J (2012). Selecting queries from sample to crawl deep web data sources, Web Intelligence and Agent Systems, 10:1, (75-88), Online publication date: 1-Jan-2012.
  38. ACM
    Agrawal R, Gollapudi S, Kannan A and Kenthapadi K Enriching textbooks with images Proceedings of the 20th ACM international conference on Information and knowledge management, (1847-1856)
  39. ACM
    Prifti T, Banerjee S and Cukic B Detecting bug duplicate reports through local references Proceedings of the 7th International Conference on Predictive Models in Software Engineering, (1-9)
  40. Abbott A and Watson I Ontology-Aided product classification Proceedings of the 19th international conference on Case-Based Reasoning Research and Development, (348-362)
  41. Saaya Z, Smyth B, Coyle M and Briggs P Recommending case bases Proceedings of the 19th international conference on Case-Based Reasoning Research and Development, (274-288)
  42. Saaya Z, Smyth B, Coyle M and Briggs P Recognising and recommending context in social web search Proceedings of the 19th international conference on User modeling, adaption, and personalization, (293-304)
  43. Mishra T and Bangalore S (2011). Finite-state models for speech-based search on mobile devices, Natural Language Engineering, 17:2, (243-264), Online publication date: 1-Apr-2011.
  44. Roy S, Mak M and Wan K Wikipedia based news video topic modeling for information extraction Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II, (411-420)
  45. Ramampiaro H and Li C Supporting biomedical information retrieval Transactions on large-scale data- and knowledge-centered systems IV, (73-94)
  46. ACM
    Di Buccio E, Montecchio N and Orio N FALCON Proceedings of the 18th ACM international conference on Multimedia, (1477-1480)
  47. ACM
    Di Buccio E, Montecchio N and Orio N A scalable cover identification engine Proceedings of the 18th ACM international conference on Multimedia, (1143-1146)
  48. Ramampiaro H Biomedical information retrieval Proceedings of the First international conference on Information technology in bio- and medical informatics, (143-157)
  49. Mishra T and Bangalore S Speech-driven access to the deep web on mobile devices Proceedings of the ACL 2010 System Demonstrations, (60-65)
  50. Cadenhead T, Kantarcioglu M and Thuraisingham B Scalable and efficient reasoning for enforcing role-based access control Proceedings of the 24th annual IFIP WG 11.3 working conference on Data and applications security and privacy, (209-224)
  51. Mishra T and Bangalore S Qme! Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, (55-63)
  52. Müller B, Klinger R, Gurulingappa H, Mevissen H, Hofmann-Apitius M, Fluck J and Friedrich C Abstracts versus full texts and patents Proceedings of the First international Information Retrieval Facility conference on Adbances in Multidisciplinary Retrieval, (152-165)
  53. Jabeur L, Tamine L and Boughanem M A social model for literature access Adaptivity, Personalization and Fusion of Heterogeneous Information, (32-39)
  54. Grappy A and Grau B Answer type validation in question answering systems Adaptivity, Personalization and Fusion of Heterogeneous Information, (9-15)
  55. ACM
    Liang J, Dhillon N and Koperski K A large-scale system for annotating and querying quotations in news feeds Proceedings of the 3rd International Semantic Search Workshop, (1-5)
  56. ACM
    Lin H, Rushing J, Berendes T, Stein C and Graves S Visualizations for the spyglass ontology-based information analysis and retrieval system Proceedings of the 48th annual ACM Southeast Conference, (1-6)
  57. Hung B, Otsubo M, Hijikata Y and Nishida S (2010). HITS algorithm improvement using semantic text portion, Web Intelligence and Agent Systems, 8:2, (149-164), Online publication date: 1-Apr-2010.
  58. ACM
    Fautsch C and Savoy J Adapting the tf idf vector-space model to domain specific information retrieval Proceedings of the 2010 ACM Symposium on Applied Computing, (1708-1712)
  59. Hassouna A and Tahvildari L (2010). An effort prediction framework for software defect correction, Information and Software Technology, 52:2, (197-209), Online publication date: 1-Feb-2010.
  60. Bjelland J, Burgess M, Canright G and Engø-Monsen K (2010). Eigenvectors of directed graphs and importance scores, Data Mining and Knowledge Discovery, 20:1, (98-151), Online publication date: 1-Jan-2010.
  61. ACM
    Xu W, Huang L, Fox A, Patterson D and Jordan M Detecting large-scale system problems by mining console logs Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles, (117-132)
  62. Raaijmakers S, Versloot C and De Wit J A cocktail approach to the VideoCLEF'09 linking task Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments, (401-408)
  63. Turchi M, Bie T and Cristianini N An Intelligent Agent That Autonomously Learns How to Translate Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02, (12-19)
  64. Mishra T and Bangalore S Tightly coupling speech recognition and search Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, (281-284)
  65. Roman N, Ferreira C, Meira L, Rezende R, Digiampietri L and Filho J Attribute-value specification in customs fraud detection Proceedings of the 10th Annual International Conference on Digital Government Research: Social Networks: Making Connections between Citizens, Data and Government, (264-271)
  66. ACM
    Martinez-Romo J and Araujo L Web spam identification through language model analysis Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web, (21-28)
  67. Silva J (2009). Information Filtering and Information Retrieval with the Web Filtering Toolbar, Electronic Notes in Theoretical Computer Science (ENTCS), 235, (125-136), Online publication date: 1-Apr-2009.
  68. Feng J and Bangalore S Effects of word confusion networks on voice search Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, (238-245)
  69. ACM
    Hijikata Y, Hung B, Otsubo M and Nishida S HITS algorithm improvement using anchor-related text extracted by DOM structure analysis Proceedings of the 2009 ACM symposium on Applied Computing, (1691-1698)
  70. Lu J, Wang Y, Liang J, Chen J and Liu J An Approach to Deep Web Crawling by Sampling Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, (718-724)
  71. ACM
    Forman G and Kirshenbaum E Extremely fast text feature extraction for classification and indexing Proceedings of the 17th ACM conference on Information and knowledge management, (1221-1230)
  72. Santos D, Cardoso N, Carvalho P, Dornescu I, Hartrumpf S, Leveling J and Skalban Y GikiP at GeoCLEF 2008 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access, (894-905)
  73. El Demerdash O, Kosseim L and Bergler S Image retrieval by inter-media fusion and pseudo-relevance feedback Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access, (605-611)
  74. ACM
    Forman G and Rajaram S Scaling up text classification for large file systems Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, (239-246)
  75. Libbrecht P, Desmoulins C, Mercat C, Laborde C, Dietrich M and Hendriks M Cross-Curriculum Search for Intergeo Proceedings of the 9th AISC international conference, the 15th Calculemas symposium, and the 7th international MKM conference on Intelligent Computer Mathematics, (520-535)
  76. Kaisser M The QuALiM question answering demo Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Demo Session, (32-35)
  77. Thai V, Handschuh S and Decker S IVEA Proceedings of the 5th European semantic web conference on The semantic web: research and applications, (139-153)
  78. ACM
    Dolog P, Simon B, Nejdl W and Klobučar T (2008). Personalizing access to learning networks, ACM Transactions on Internet Technology, 8:2, (1-21), Online publication date: 1-Feb-2008.
  79. Kim Y, Jung Y and Myaeng S An opinion analysis system using domain-specific lexical knowledge Proceedings of the 4th Asia information retrieval conference on Information retrieval technology, (466-471)
  80. Antunes B, Seco N and Gomes P Using ontologies for software development knowledge reuse Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence, (357-368)
  81. ACM
    Antunes B, Seco N and Gomes P Knowledge management using semantic web technologies: an application in software development Proceedings of the 4th international conference on Knowledge capture, (187-188)
  82. Ochoa X and Duval E Relevance ranking metrics for learning objects Proceedings of the Second European conference on Technology Enhanced Learning: creating new learning experiences on a global scale, (262-276)
  83. Cha B, Kim K and Lee D Study of digital license search for intellectual property rights of S/W source code Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III, (201-212)
  84. ACM
    Janssens F, Glänzel W and De Moor B Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, (360-369)
  85. Celino I, Valle E, Cerizza D and Turati A Squiggle Proceedings of the 7th international conference on Web engineering, (485-490)
  86. Qin Z, Thint M and Sufyan Beg M Deduction Engine Design for PNL-Based Question Answering System Proceedings of the 12th international Fuzzy Systems Association world congress on Foundations of Fuzzy Logic and Soft Computing, (253-262)
  87. ACM
    Moreira J, Michael M, Da Silva D, Shiloach D, Dube P and Zhang L Scalability of the Nutch search engine Proceedings of the 21st annual international conference on Supercomputing, (3-12)
  88. ACM
    Ammons G, Appavoo J, Butrico M, Da Silva D, Grove D, Kawachiya K, Krieger O, Rosenburg B, Van Hensbergen E and Wisniewski R Libra Proceedings of the 3rd international conference on Virtual execution environments, (44-54)
  89. ACM
    Nandi A and Jagadish H Assisted querying using instant-response interfaces Proceedings of the 2007 ACM SIGMOD international conference on Management of data, (1156-1158)
  90. ACM
    Jagadish H, Chapman A, Elkiss A, Jayapandian M, Li Y, Nandi A and Yu C Making database systems usable Proceedings of the 2007 ACM SIGMOD international conference on Management of data, (13-24)
  91. Weiss C, Premraj R, Zimmermann T and Zeller A How Long Will It Take to Fix This Bug? Proceedings of the Fourth International Workshop on Mining Software Repositories
  92. ACM
    Vanderlei T, Durão F, Martins A, Garcia V, Almeida E and de L. Meira S A cooperative classification mechanism for search and retrieval software components Proceedings of the 2007 ACM symposium on Applied computing, (866-871)
  93. Parapar J, Casanova J and Barreiro Á NowOnWeb Proceedings of the 11th international conference on Computer aided systems theory, (225-232)
  94. Kouylekov M, Negri M, Magnini B and Coppola B Towards entailment-based question answering Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval, (526-536)
  95. Zanker M, Gordea S, Jessenitschnig M and Schnabl M A hybrid similarity concept for browsing semi-structured product items Proceedings of the 7th international conference on E-Commerce and Web Technologies, (21-30)
  96. Garcia V, Lucrédio D, Durão F, Santos E, de Almeida E, de Mattos Fortes R and de Lemos Meira S From specification to experimentation Proceedings of the 9th international conference on Component-Based Software Engineering, (82-97)
  97. Yang F, Feng J and Di Fabbrizio G A data driven approach to relevancy recognition for contextual question answering Proceedings of the Interactive Question Answering Workshop at HLT-NAACL 2006, (33-40)
  98. Mathiak B, Kupfer A, Münch R, Täubner C and Eckstein S Improving literature preselection by searching for images Proceedings of the 2006 international conference on Knowledge Discovery in Life Science Literature, (18-28)
  99. Aunimo L and Kuuskoski R Question answering experiments for finnish and french Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories, (477-487)
  100. Leidner J Experiments with geo-filtering predicates for IR Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories, (987-996)
  101. Kornai A Evaluating geographic information retrieval Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories, (928-938)
Contributors

Reviews

Arthur Gittleman

Search is at the heart of computing today. Google provides new features regularly, and has many competitors. Lucene is an open-source search engine that allows you to incorporate sophisticated search into your applications. This book, as its title implies, seeks to quickly get you up to speed in using Lucene. It succeeds admirably in elucidating the application programming interface (API), with many code examples and cogent explanations, opening the door to a fine tool. The first six chapters form the "Core Lucene" unit, while the last four make up "Applied Lucene." The core starts with an introductory chapter containing simple indexing and search examples. The next two chapters cover indexing and search more thoroughly. Chapter 4, on analyzers, discusses the process of converting the input text into terms that will be indexed. Lucene provides several analyzers, or one may create a custom analyzer. The last two core chapters address advanced search techniques, and extending search beyond Lucene's built-in capabilities. The core chapters assume text input, whereas the first applied chapter considers the parsing of Extensible Markup Language (XML), portable document format (PDF), Hypertext Markup Language (HTML), Microsoft Word, and rich text format (RTF) documents. Chapter 8 introduces third-party Lucene tools. Lucene is a Java application, but chapter 9 covers ports to other languages, including C++, C#, Perl, and Python. The last chapter presents interesting case studies, including Nutch, an open-source search engine. Other studies delineate Lucene's use at jGuru, Michaels.com, and TheServerSide. For those who want to look inside, an appendix discusses the Lucene index format. Hatcher has written on Ant and JUnit, and the authors incorporate Ant and JUnit in the Lucene examples. The introduction includes a few pages about JUnit, but not enough to convey its spirit to those who have not used it before. A JUnit appendix would be a good addition, or a Web site supplement. The book's Web site does contain the code for downloading, and an errata list. The examples do work using Ant, as the authors suggest, but they often just output that the test succeeded, without showing any details of the search, or of the indices created. The BaseIndexingTestCase in the download file has two tests that are not shown in the printed text on page 32. These have the effect of recreating the original index after the test completes, so one cannot see the index created during the test. Removing these extra tests, and using the Luke tool described in chapter 8, will allow closer inspection of the indices created by the tests, and make the study of the examples more concrete. Search powers the information age. This book is a gateway to this invaluable resource. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Recommendations