Abstract
Traditional search tools that employ keyword and phrase matching between the query and search index alone tend to offer high recall and low precision. The search users are faced with too many irrelevant results. In order to solve this problem, we propose a novel search technique that effectively searches the target documents by the search query whose definition is based on the document type, search terms and the semantic relationship between the search terms and the target documents. We present a technique that collects search terms and their semantic relationship from office documents and generates XML-based search indices. The search system implementation and query response time evaluation are also discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Adobe Systems Incorporated, Adobe LiveCycle Designer ES (2009), http://www.adobe.com/products/livecycle/designer/
Apple, “Spotlight” (2009), http://www.apple.com/macosx/features/300.html , #spotlight
Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R.S., Peng, Y., Reddivari, P., Doshi, V.C., Sachs, J.: Swoogle: A search and metadata engine for the semantic web. In: Proc. of CIKM 2004, pp. 652–659 (2004)
Dittrich, J.-P., Duda, C., Jarisch, B., Kossmann, D., Vaz, M.A.: Salles ETH Zurich. Bringing Precision to Desktop Search: A Predicate-based Desktop Search Architecture. In: Proc. of ICDE 2007, pp. 1461–1465 (2007)
Dublin Core Metadata Initiative, DCMI Metadata Terms (2006), http://dublincore.org/documents/dcmi-terms/
Dumais, S., Cutrell, E., Cadiz, J., Jancke, G., Sarin, R., Robbins, D.C.: Stuff i’ve seen: A system for personal information retrieval and re-use. In: Proc. of SIGIR 2003, pp. 72–79 (2003)
Google, Google Desktop Search (2009), http://desktop.google.com/en/GB/features.html
Jenkins, C., Inman, D.: Server-Side Automatic Metadata Generation using Qualified Dublin Core and RDF. In: Proc. of Int. Conf. on Digital Libraries, pp. 245–253 (2000)
Korol, J.: Excel 2003 VBA programming with XML and ASP. Wordware Publishing (2006)
Liberty, J., Hurwitz, D.: Programming Asp. Net, Oreilly & Associates Inc. (2003)
Microsoft, Technical Overview of Internet Information Services (IIS) 6.0 (2002), http://www.microsoft.com/windowsserver2003/techinfo/overview/iis.mspx
Microsoft, Microsoft Office Suites (2009), http://office.microsoft.com/en-us/suites/HA101757031033.aspx
Rinaldi, A.M.: A content-based approach for document representation and retrieval. In: DocEng 2008, pp. 106–109. ACM Press, New York (2008)
W3C, XML Path Language (XPath) Version 1.0 (1999), http://www.w3.org/TR/1999/REC-xpath-19991116
W3C, XML Schema (2001), http://www.w3c.org/XML/Schema
W3C, Resource Description Framework, RDF (2004), http://www.w3.org/RDF/
W3C, Extensible Markup Language (XML) 1.0 (4th edn.) (2006), http://www.w3.org/TR/2006/REC-xml-20060816/
X1 Technologies, X1 Professional Client (2009), http://www.x1.com/products/xds.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chatvichienchai, S., Tanaka, K. (2009). Bringing Precision to Office Document Search by Semantic Relationship Approach. In: Papasratorn, B., Chutimaskul, W., Porkaew, K., Vanijja, V. (eds) Advances in Information Technology. IAIT 2009. Communications in Computer and Information Science, vol 55. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10392-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-10392-6_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10391-9
Online ISBN: 978-3-642-10392-6
eBook Packages: Computer ScienceComputer Science (R0)