Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/312624.312647acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article
Free access

A re-examination of text categorization methods

Published: 01 August 1999 Publication History
  • Get Citation Alerts
  • First page of PDF

    References

    [1]
    C. Apte, N. Damerau, and S. Weiss. Towards language independent automated learning of text categorization models. In Proceedings of the 17th Annual A CM/SIGIR conference, 1994.
    [2]
    C. Apte, F. Damerau, and S. Weiss. Text mining with decision rules and decision trees. In Proceedings of the Conference on Automated Learning and Discorery, Workshop 6: Learning from Text and the Web, 1998.
    [3]
    L. Douglas Baker and Andrew K. Mccallum. Distributional clustering of words for text categorization. In Proceedings of the 21th Ann Int A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'98), pages 96-103, 1998.
    [4]
    D. Berry and B.W. Lindgren. Statistics: Theory and Methods. Brooks/Cole, Pacific Grove, California, 1990.
    [5]
    William W. Cohen. Text categorization and relational learning. In The Twelfth International Conference on Machine Learning (ICML'95). Morgan Kaufmann, 1995.
    [6]
    William W. Cohen and Yoram Singer. Context-sensitive learning methods for text categorization. In SIGIR '96: Proceedings of the 19th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, 1996. 307-315.
    [7]
    C. Cortes and V. Vapnik. Support vector networks. Machine Learning, 20:273-297, 1995.
    [8]
    Belur V. Dasarathy. Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques. McGraw-Hill Computer Science Series. IEEE Computer Society Press, Las Alamitos, California, 1991.
    [9]
    N. Fuhr, S. Hartmanna, G. Lustig, M. Schwantner, and K. Tzeras. Air/x - a rule-based multistage indexing systems for large subject fields. In 606-623, editor, Proceedings of RIAO'91, 1991.
    [10]
    P.J. Hayes and S. P. Weinstein. Construe/tis: a system for content-based indexing of a database of new stories. In Second Annual Conference on Innovative Applications of ArtificiaI Intelligence, 1990.
    [11]
    Makato Iwayama and Takenobu Tokunaga. Cluster-based text categorization: a comparison of category search strategies. In Proceedings of the 18th Ann Int A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'95), pages 273-281, 1995.
    [12]
    Thorsten Joachims. Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In European Conference on Machine Learning (ECML), 1998.
    [13]
    D. Koller and M. Sahami. Hierarchically classifying documents using very few words. In The Fourteenth International Conference on Machine Learning (ICML'97), pages 170-178, 1997.
    [14]
    W. Lam and C.Y. Ho. Using a generalized instance set for automatic text categorization. In Proceedings of the 21th Ann Int A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'98), pages 81-89, 1998.
    [15]
    David D. Lewis, Robert E. Schapire, James P. Callan, and Ron Papka. Training algorithms for linear text classifiers. In SIGIR '96: Proceedings of the 19th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, 1996. 298-306.
    [16]
    D.D. Lewis and M. Ringuette. Comparison of two learning algorithms for text categorization. In Proceedings of the Third Annual Symposium on Document Analysis and Information Retrieval (SDAIR'94), 1994.
    [17]
    B. Masand, G. Linoff, and D. Waltz. Classifying news stories using memory based reasoning. In 15th Ann Int A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'92), pages 59-64, 1992.
    [18]
    A. McCallum and K. Nigam. A comparison of event models for naive bayes text classification. In AAAI-98 Workshop on Learning for Text Categorization, 1998.
    [19]
    Tom Mitchell. Machine Learning. McGraw Hill, 1996.
    [20]
    I. Moulinier. Is learning bias an issue on the text categorization problem? In Technical report, LAFORIA-LIP6, Universite Paris VI, 1997.
    [21]
    I. Moulinier, G. Raskinis, and J. Ganascia. Text categorization: a symbolic approach. In Proceedings of the Fifth Annual Symposium on Document Analysis and Information Retrieval, 1996.
    [22]
    H.T. Ng, W.B. Goh, and K.L. Low. Feature selection, perceptron learning, and a usability case study for text categorization. In 20th Ann Int A CM SIGIR Conference on Research and Development in Information Retrieval (SI- GIR'97), pages 67-73, 1997.
    [23]
    Osuna, R. Freund, and F. Girosi. Support vector machines: Training and applications. In A.L Memo. MIT A.I. Lab, 1996.
    [24]
    J. Platt. Sequetial minimal optimization: A fast algorithm for training support vector machines. In Technical Report MST-TR-98-14. Microsoft Research, 1998.
    [25]
    K. Tzeras and S. Hartman. Automatic indexing based on bayesian inference networks. In Proc 16th Ann Int A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'93), pages 22-34, 1993.
    [26]
    C.J. van Rijsbergen. Information Retrieval. Butterworths, London, 1979.
    [27]
    V. Vapnic. The Nature of Statistical Learning Theory. Springer, New York, 1995.
    [28]
    E. Wiener, J.O. Pedersen, and A.S. Weigend. A neural network approach to topic spotting. In Proceedings of the Fourth Annual Symposium on Document Analysis and Information Retrieval (SDAIR'95), 1995.
    [29]
    Y. Yang. Expert network: Effective and efficient learning from human decisions in text categorization and retrieval. In 17th Ann Int A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'94), pages 13- 22, 1994.
    [30]
    Y. Yang. Sampling strategies and learning efficiency in text categorization. In AAAI Spring Symposium on Machine Learning in Information Access, pages 88-95, 1996.
    [31]
    Y. Yang. An evaluation of statistical approaches to text categorization. Journal of Information Retrieval (to appear), 1999.
    [32]
    Y. Yang and C.G. Chute. An example-based mapping method for text categorization and retrieval. A CM Transaction on Information Systems (TOIS), 12(3):252-277, 1994.
    [33]
    Y. Yang and J.P. Pedersen. Feature selection in statistical learning of text categorization. In The Fourteenth International Conference on Machine Learning, pages 412-420, 1997.

    Cited By

    View all
    • (2024)Utilizing Artificial Intelligence for Text Classification in Communication SciencesDesign and Development of Emerging Chatbot Technology10.4018/979-8-3693-1830-0.ch013(218-235)Online publication date: 15-Mar-2024
    • (2024)Research on Aspect-Level Sentiment Analysis Based on Adversarial Training and Dependency ParsingElectronics10.3390/electronics1310199313:10(1993)Online publication date: 20-May-2024
    • (2024)Joint Representation Learning for Retrieval and Annotation of Genomic Interval SetsBioengineering10.3390/bioengineering1103026311:3(263)Online publication date: 8-Mar-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '99: Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
    August 1999
    339 pages
    ISBN:1581130961
    DOI:10.1145/312624
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 August 1999

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Conference

    SIGIR99
    Sponsor:

    Acceptance Rates

    SIGIR '99 Paper Acceptance Rate 33 of 135 submissions, 24%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)611
    • Downloads (Last 6 weeks)57
    Reflects downloads up to 27 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Utilizing Artificial Intelligence for Text Classification in Communication SciencesDesign and Development of Emerging Chatbot Technology10.4018/979-8-3693-1830-0.ch013(218-235)Online publication date: 15-Mar-2024
    • (2024)Research on Aspect-Level Sentiment Analysis Based on Adversarial Training and Dependency ParsingElectronics10.3390/electronics1310199313:10(1993)Online publication date: 20-May-2024
    • (2024)Joint Representation Learning for Retrieval and Annotation of Genomic Interval SetsBioengineering10.3390/bioengineering1103026311:3(263)Online publication date: 8-Mar-2024
    • (2024)Three-Branch BERT-Based Text Classification Network for Gastroscopy Diagnosis TextInternational Journal of Crowd Science10.26599/IJCS.2023.91000318:1(56-63)Online publication date: Feb-2024
    • (2024)A Comparative Analysis of Feature Selection Algorithms in Cross Domain Sentiment ClassificationRecent Advances in Computer Science and Communications10.2174/012666255827688924012506285717:3Online publication date: May-2024
    • (2024)Classifying User Roles in Online News Forums: A Model for User Interaction and Behavior AnalysisAdjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization10.1145/3631700.3665187(240-249)Online publication date: 27-Jun-2024
    • (2024)Deep Efficient Continuous Manifold Learning for Time Series ModelingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.332012546:1(171-184)Online publication date: Jan-2024
    • (2024)TagRec++: Hierarchical Label Aware Attention Network for Question CategorizationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.3354504(1-12)Online publication date: 2024
    • (2024)Variational Continuous Label Distribution Learning for Multi-Label Text ClassificationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.3323401(1-15)Online publication date: 2024
    • (2024)Reconstructive Classification for Age and Gender Identification in Social NetworksIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.326776611:2(2291-2301)Online publication date: Apr-2024
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media