Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/188490.188556acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article
Free access

Properties of extended Boolean models in information retrieval

Published: 01 August 1994 Publication History
  • Get Citation Alerts
  • First page of PDF

    References

    [1]
    Buell DA. A general model of query processing in information retrieval system, inforination Processing & Management 1981; 17(5):249-262
    [2]
    Radecki T. Fuzzy set theoretical approach to document retrieval. Information Processing & Management 1979; 15(5):247-259
    [3]
    Sachs WM. An approach to associative retrieval through the theory of fuzzy sets. Journal of the American Society for Information Science 1976; 27:85-87
    [4]
    Bookstein A. Fuzzy requests: an approach to weighted boolean searches. Journal of the American Society for information Science 1980; 31 (4):240-247
    [5]
    Lee JH, Kim MH, Lee YJ. Ranking documents in thesanrus-based boolean retrieval systems. Information Processing & Management 1994:30(1):79-91
    [6]
    Waller WG, Kraft DH. A mathematical model of a weighted boolean retrieval system. Information Processing & Manageinent 1979:15:235-245
    [7]
    Paice CP. Soft evaluation of boolean search queries in information retrieval systems. Information Technology: Research and Development 1984; 3(1):33-42
    [8]
    Salton G, Fox EA, Wu H. Extended boolean information retrieval. Communications of the ACM 1983; 26(11): 1022-1036
    [9]
    Smith ME. Aspects of the p-norm model of reformation retrieval: syntactic query generation, efficiency, and theoretical properties. Phi) thesis, Cornell University, 1990.
    [10]
    Zimmennann HJ. Fuzzy set theory and its applications, 2rid edition. Kluwer Academic Publishers, 1991
    [11]
    Lee JH, Kim MH, Lee Yj. Enhancing the fuzzy set model for high quality document rankings. In: Proceedings of the 19th Euromicro Conference. 1992, pp. 337-344
    [12]
    Kim MH, Lee JH, Lee YJ. Analysis of fuzzy operators for high quality information retrieval. Information Processing Letters 1993; 46(5):251-256
    [13]
    Lee JH, Kim WY, Kim MH, Lee YJ. On the evaluation of boolean operators in the extended boolean retrieval framework. In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Rea-icval. 1993, pp. 291-297
    [14]
    Fox EA, Betrabet S, Koushik M, Lee W. Extended boolean models. In: Frakes WB, Yates RB (ed) Information Retrieval Data Structures & Algorithms. Prentice Hall, 1992, pp. 393418
    [15]
    Zimmermann HJ. Fuzzy sets, decision making, and expert systems. Kluwer Academic Publishers, 1987

    Cited By

    View all
    • (2021)Unsupervised Topical Organization of Documents using Corpus-based Text AnalysisProceedings of the 13th International Conference on Management of Digital EcoSystems10.1145/3444757.3485078(87-94)Online publication date: 1-Nov-2021
    • (2018)A many-sorted theory proposal for information retrievalKnowledge and Information Systems10.1007/s10115-017-1074-955:1(113-139)Online publication date: 1-Apr-2018
    • (2012)Multiagent systems and information retrieval our experience with X.MASExpert Systems with Applications: An International Journal10.1016/j.eswa.2011.08.10339:3(2509-2523)Online publication date: 1-Feb-2012
    • Show More Cited By

    Recommendations

    Reviews

    Danny B. Lange

    Ranked document retrieval outputs in decreasing order of query-document similarities are obviously useful to control the size of a retrieved document set. Conventional Boolean retrieval systems, however, do not provide such ranked output because they cannot compute similarity coefficients between queries and documents. The author of this paper analyzes several extended Boolean models in order to determine which one is the most suitable for achieving high retrieval effectiveness. Although extended Boolean models use document term weights to calculate query-document similarities, ranking is often not satisfactory. The author demonstrates with clear examples that some models (fuzzy set models) can generate incorrectly ranked output that does not agree with human behavior. Positively compensatory operators and binary soft Boolean operators in other models (Waller-Kraft, Paice, P-Norm [1], and Infinite-One) are shown to overcome this problem. The author continues to demonstrate with a new set of clear examples that these models (except for P-Norm) still violate the usual assumption that all the terms given in a query are equally important. Lee concludes that, since P-Norm is the only model that solves both the deficiency of fuzzy models and the unequal importance problem, it is more effective than any of the other extended Boolean models. The concluding section is devoted to an analysis of the meaning of query weights. The analysis concludes that P-Norm is superior, since it uses relative query weights, found to be easier for users to write than absolute query weights. The author provides clear examples and presents the analysis in a very readable and convincing form supported by well-written mathematical proofs.

    Access critical reviews of Computing literature here

    Become a reviewer for Computing Reviews.

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '94: Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
    August 1994
    363 pages
    ISBN:038719889X

    Sponsors

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 01 August 1994

    Check for updates

    Qualifiers

    • Article

    Conference

    SIGIR94
    Sponsor:
    • AICA
    • Irish Comp Soc
    • SIGIR
    • BCS-IRSG
    • BCS-IRSB
    • Dublin City University

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)66
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Unsupervised Topical Organization of Documents using Corpus-based Text AnalysisProceedings of the 13th International Conference on Management of Digital EcoSystems10.1145/3444757.3485078(87-94)Online publication date: 1-Nov-2021
    • (2018)A many-sorted theory proposal for information retrievalKnowledge and Information Systems10.1007/s10115-017-1074-955:1(113-139)Online publication date: 1-Apr-2018
    • (2012)Multiagent systems and information retrieval our experience with X.MASExpert Systems with Applications: An International Journal10.1016/j.eswa.2011.08.10339:3(2509-2523)Online publication date: 1-Feb-2012
    • (2010)Extended Boolean retrieval for systematic biomedical reviewsProceedings of the Thirty-Third Australasian Conferenc on Computer Science - Volume 10210.5555/1862199.1862212(117-126)Online publication date: 1-Jan-2010
    • (2010)Quantum logic based MPEG query format algebraProceedings of the 8th international conference on Adaptive Multimedia Retrieval: context, exploration, and fusion10.1007/978-3-642-27169-4_15(204-219)Online publication date: 17-Aug-2010
    • (2010)QSQLProceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part I10.1007/978-3-642-12026-8_33(429-443)Online publication date: 1-Apr-2010
    • (2009)The challenge of high recall in biomedical systematic searchProceedings of the third international workshop on Data and text mining in bioinformatics10.1145/1651318.1651338(89-92)Online publication date: 6-Nov-2009
    • (2009)Rich document representation and classificationKnowledge-Based Systems10.1016/j.knosys.2008.06.00222:1(67-71)Online publication date: 1-Jan-2009
    • (2009)SurveyComputer Science Review10.1016/j.cosrev.2009.03.0013:3(151-173)Online publication date: 1-Aug-2009
    • (2009)Experiments with Automatic Query Formulation in the Extended Boolean ModelProceedings of the 12th International Conference on Text, Speech and Dialogue10.1007/978-3-642-04208-9_51(371-378)Online publication date: 25-Aug-2009
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media