Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/544220.544246acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

Comparison of two approaches to building a vertical search tool: a case study in the nanotechnology domain

Published: 14 July 2002 Publication History

Abstract

As the Web has been growing exponentially, it has become increasingly difficult to search for desired information. In recent years, many domain-specific (vertical) search tools have been developed to serve the information needs of specific fields. This paper describes two approaches to building a domain-specific search tool. We report our experience in building two different tools in the nanotechnology domain -- (1) a server-side search engine, and (2) a client-side search agent. The designs of the two search systems are presented and discussed, and their strengths and weaknesses are compared. Some future research directions are also discussed.

References

[1]
Bowman, C. M., Danzig, P. B., Manber, U., and Schwartz F. Scalable Internet Resource Discovery: Research Problems and Approaches, Communications of the ACM, 37(8) (1994), 98--107
[2]
Brin, S. and Page, L. The Anatomy of a Large-Scale Hypertextual Web Search Engine. In Proceedings of the 7th International World Wide Web Conference (WWW7), Brisbane, Australia, Apr 1998
[3]
Chakrabarti, S., van den Berg, M., and Dom B. Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery. In Proceedings of the 8th International World Wide Web Conference (WWW8), Toronto, Canada, May 1999
[4]
Chau, M., Chen, H., Qin, J., Zhou, Y., Sung, W. K., Chen, Y., Qin, Y., McDonald, D., Lally, A., and Landon, M. NanoPort: A Web Portal for Nanoscale Science and Technology. In Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL'02), Portland, OR, USA, July 2002
[5]
Chau, M., Zeng, D., and Chen, H. Personalized Spiders for Web Search and Analysis. In Proceedings of the First ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL'01), Roanoke, VA, USA, June 2001
[6]
Chen, H. "Collaborative Systems: Solving the Vocabulary Problem," IEEE Computer, Special Issue on Computer-Supported Cooperative Work (CSCW), 27(5) (1994), 58--66
[7]
Chen, H., Fan, H., Chau, M., and Zeng, D. MetaSpider: Meta-Searching and Categorization on the Web, Journal of the American Society for Information Science and Technology, 52(13), 1134--1147 (2001)
[8]
Chen, H., Schufels, C., and Orwig, R. Internet Categorization and Search: A Self-Organizing Approach, Journal of Visual Communication and Image Representation, 7(1), 88--102 (1996)
[9]
Courteau, J. "Genome Databases," Science, 254, (1991), 201--207
[10]
DeBra, P. and Post, R. Information retrieval in the World-Wide Web: Making Client-based Searching Feasible. In Proceedings of the First International World Wide Web Conference, Geneva, Switzerland, 1994
[11]
Fox, E., Hix, D., Nowell, L. T., Brueni, D. J., Wake, W. C., Lenwood, S. H., and Rao, D. Users, User Interfaces, and Objects: Envision, A Digital Library. Journal of the American Society for Information Science, 44(8) (1993), 480--491
[12]
Furnas, G. W., Landauer, T. K., Gomez, L. M., and Dumais, S. T. "The Vocabulary Problem in Human-System Communication" Communications of the ACM, 30(11), (1987), 964--971
[13]
Hearst, M. A. TextTiling: Segmenting Text into Multi-paragraph Subtopics Passages. Computational Linguistics, 23(1) (1997), 33--64
[14]
Hearst, M. A. and Pedersen, J. Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results, in Proceedings of the 19th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'96), 76--84 (1996)
[15]
Hovy, E. and Lin, C. Y. Automated Text Summarization in SUMMARIST. Advances in Automatic Text Summarization, 81--94, MIT Press 1999
[16]
Kohonen, T. Self-Organizing Maps. Springer-Verlag, Berlin, 1995
[17]
Lawrence, S. and Giles, C. L., Inquirus, the NECI Meta Search Engine. In Proceedings of the 7th International World Wide Web Conference, Brisbane, Australia, Apr 1998
[18]
Lawrence, S. and Giles, C. L. Accessibility of Information on the Web, Nature, 400 (1999), 107--109
[19]
Lin, C., Chen, H., and Nunamaker, J. Verifying the Proximity and Size Hypothesis for Self-Organizing Maps. Journal of Management Information Systems, 16(3) (1999-2000), 61--73
[20]
Lin, X., Soergel, D., and Marchionini, G. A Self-organizing Semantic Map for Information Retrieval, in Proceedings of the 14th International ACM SIGIR Conference on Research and Development in Information Retrieval (1991), 262--269
[21]
Luhn, H. P. The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development 2 (2), 159--165 (1959)
[22]
Mani, I. and Maybury, M. T. Advances in Automatic Text Summarization. MIT Press, 1999, ix-xv
[23]
Mauldin, M. L. Lycos: Design Choices in an Internet Search Service. IEEE Expert, 12(1) (1997), 8--11
[24]
McBryan, O. A. GENVL and WWWW: Tools for Taming the Web. In Proceedings of the 1st International World Wide Web Conference, Geneva, Switzerland, 1994
[25]
Pinkerton, B. Finding What People Want: Experiences with the WebCrawler. In Proceedings of the 2nd International World Wide Web Conference, Chicago, IL, USA, 1994
[26]
Shneiderman, B., Feldman, D., Rose, A. and Grau, X. F. Visualizing Digital Library Search Results with Categorical and Hierarchical Axes, in Proceedings of 5th ACM Conference on ACM 2000 Digital Libraries, San Antonio, TX, USA, 2000
[27]
Stix, G. (ed.). Nanotechnology. Scientific America, September 2001 (entire issue)
[28]
Tolle, K. M. and Chen, H. Comparing Noun Phrasing Techniques for Use with Medical Digital Library Tools. Journal of the American Society for Information Science, 51(4) (2000), 352--370
[29]
Veerasamy, A. and Belkin, N. J., Evaluation of a Tool for Visualization of Information Retrieval Results. In Proceedings of the 19th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'96), 85--92, 1996

Cited By

View all
  • (2016)Insights into the Search Behavior of Non-Medical Professionals Based on Task Difficulty and an Evaluation against New Generation Medical Information Retrieval StrategiesBusiness Intelligence10.4018/978-1-4666-9562-7.ch065(1314-1339)Online publication date: 2016
  • (2015)A Search Engine Development Utilizing Unsupervised Learning ApproachIntelligence in the Era of Big Data10.1007/978-3-662-46742-8_21(223-233)Online publication date: 2015
  • (2014)Insights into the Search Behavior of Non-Medical Professionals Based on Task Difficulty and an Evaluation against New Generation Medical Information Retrieval StrategiesAdvancing Medical Practice through Technology10.4018/978-1-4666-4619-3.ch001(1-26)Online publication date: 2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
July 2002
448 pages
ISBN:1581135130
DOI:10.1145/544220
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 July 2002

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. indexing
  2. information retrieval
  3. internet searching and browsing
  4. internet spider
  5. noun-phrasing
  6. personalization
  7. post-retrieval analysis
  8. self-organizing map
  9. summarization
  10. vertical search engine
  11. web search engine

Qualifiers

  • Article

Conference

JCDL02
Sponsor:
JCDL02: Joint Conference on Digital Libraries 2002
July 14 - 18, 2002
Oregon, Portland, USA

Acceptance Rates

JCDL '02 Paper Acceptance Rate 69 of 240 submissions, 29%;
Overall Acceptance Rate 415 of 1,482 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Insights into the Search Behavior of Non-Medical Professionals Based on Task Difficulty and an Evaluation against New Generation Medical Information Retrieval StrategiesBusiness Intelligence10.4018/978-1-4666-9562-7.ch065(1314-1339)Online publication date: 2016
  • (2015)A Search Engine Development Utilizing Unsupervised Learning ApproachIntelligence in the Era of Big Data10.1007/978-3-662-46742-8_21(223-233)Online publication date: 2015
  • (2014)Insights into the Search Behavior of Non-Medical Professionals Based on Task Difficulty and an Evaluation against New Generation Medical Information Retrieval StrategiesAdvancing Medical Practice through Technology10.4018/978-1-4666-4619-3.ch001(1-26)Online publication date: 2014
  • (2014)Domain Specific SearchProfessional Search in the Modern World10.1007/978-3-319-12511-4_6(96-117)Online publication date: 2014
  • (2013)Toward a model of domain-specific searchProceedings of the 10th Conference on Open Research Areas in Information Retrieval10.5555/2491748.2491757(33-36)Online publication date: 15-May-2013
  • (2008)Identification of time-varying objects on the webProceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries10.1145/1378889.1378939(285-294)Online publication date: 16-Jun-2008
  • (2008)Patterns of Information Search and Access on the World Wide Web: Democratizing Expertise or Creating New Hierarchies?Journal of Computer-Mediated Communication10.1111/j.1083-6101.2008.00419.x13:4(769-793)Online publication date: Jul-2008
  • (2007)ReCQProceedings of the 6th international and interdisciplinary conference on Modeling and using context10.5555/1770806.1770825(248-262)Online publication date: 20-Aug-2007
  • (2007)Extracting domain-specific terms from unlabeled web documents by bootstrapping and term classifiers2007 IEEE International Conference on Systems, Man and Cybernetics10.1109/ICSMC.2007.4413834(3875-3880)Online publication date: Oct-2007
  • (2007)ReCQ: Real-World Context-Aware QueryingModeling and Using Context10.1007/978-3-540-74255-5_19(248-262)Online publication date: 2007
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media