Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

A vector space model for automatic indexing

Published: 01 November 1975 Publication History
  • Get Citation Alerts
  • Abstract

    In a document retrieval, or other pattern matching environment where stored entities (documents) are compared with each other or with incoming patterns (search requests), it appears that the best indexing (property) space is one where each entity lies as far away from the others as possible; in these circumstances the value of an indexing system may be expressible as a function of the density of the object space; in particular, retrieval performance may correlate inversely with space density. An approach based on space density computations is used to choose an optimum indexing vocabulary for a collection of documents. Typical evaluation results are shown, demonstating the usefulness of the model.

    References

    [1]
    Salton, G. Automatic btformation Organiza;ion and Retrieval. McGraw-Hill, New York, 1968, Ch. 4.
    [2]
    Salton, G., and Yang, C.S. On the specification of term values in automatic indexing. J. Documen. 29, 4 (Dec. 1973), 351-372.
    [3]
    Sparck Jones, K. A statistical interpretation of term specificity and its application to retrieval. J. Documen. 28, 1 (March 1972), 11-20.
    [4]
    Williamson, R.E. Real-time document retrieval. Ph.D. Th., Computer Sci. Dep., Cornell U., June 1974.
    [5]
    Wong, A. An investigation of the effects of different indexing methods on the document space configuration. Sci. Rep. ISR-22, Computer Sci. Dep., Cornell U., Section II, Nov. 1974.
    [6]
    Salton, G. A theory of indexing. Regional Conference Series in Applied Mathematics No. 18, SIAM, Philadelphia, Pa., 1975.
    [7]
    Salton, G., Yang, C.S., and Yu, C.T. Contribution to the theory of indexing. Proc. IFIP Congress 74, Stockholm, August 1974. American Elsevier, New York, 1974.

    Cited By

    View all
    • (2024)Keyword Pool Generation for Web Text Collecting: A Framework Integrating Sample and Semantic InformationMathematics10.3390/math1203040512:3(405)Online publication date: 26-Jan-2024
    • (2024)Identifying Land Use Functions in Five New First-Tier Cities Based on Multi-Source Big DataLand10.3390/land1303027113:3(271)Online publication date: 21-Feb-2024
    • (2024)Development of Context-Based Sentiment Classification for Intelligent Stock Market PredictionBig Data and Cognitive Computing10.3390/bdcc80600518:6(51)Online publication date: 22-May-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Communications of the ACM
    Communications of the ACM  Volume 18, Issue 11
    Nov. 1975
    54 pages
    ISSN:0001-0782
    EISSN:1557-7317
    DOI:10.1145/361219
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 November 1975
    Published in CACM Volume 18, Issue 11

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. automatic indexing
    2. automatic information retrieval
    3. content analysis
    4. document space

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2,312
    • Downloads (Last 6 weeks)266

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Keyword Pool Generation for Web Text Collecting: A Framework Integrating Sample and Semantic InformationMathematics10.3390/math1203040512:3(405)Online publication date: 26-Jan-2024
    • (2024)Identifying Land Use Functions in Five New First-Tier Cities Based on Multi-Source Big DataLand10.3390/land1303027113:3(271)Online publication date: 21-Feb-2024
    • (2024)Development of Context-Based Sentiment Classification for Intelligent Stock Market PredictionBig Data and Cognitive Computing10.3390/bdcc80600518:6(51)Online publication date: 22-May-2024
    • (2024)Document Retrieval System for Biomedical Question AnsweringApplied Sciences10.3390/app1406261314:6(2613)Online publication date: 20-Mar-2024
    • (2024)Decision Support System Driven by Thermo-Complexity: Scenario Analysis and Data VisualizationApplied Sciences10.3390/app1406238714:6(2387)Online publication date: 12-Mar-2024
    • (2024)A Review of Current Trends, Techniques, and Challenges in Large Language Models (LLMs)Applied Sciences10.3390/app1405207414:5(2074)Online publication date: 1-Mar-2024
    • (2024)Exploring the Cognitive Neural Basis of Factuality in Abstractive Text Summarization Models: Interpretable Insights from EEG SignalsApplied Sciences10.3390/app1402087514:2(875)Online publication date: 19-Jan-2024
    • (2024)Is the expression of different discrete emotions related to time? Evidence from online Chinese reviews using sentiment analysis and human behavior dynamicsFrontiers in Psychology10.3389/fpsyg.2024.132158215Online publication date: 6-Mar-2024
    • (2024)A survey on knowledge-aware news recommender systemsSemantic Web10.3233/SW-22299115:1(21-82)Online publication date: 12-Jan-2024
    • (2024)Siamese capsule network with position correlation and integrating articles of law for Chinese similar case matchingJournal of Intelligent & Fuzzy Systems10.3233/JIFS-23218546:2(4713-4731)Online publication date: 14-Feb-2024
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media