Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1102351.1102445acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Learning hierarchical multi-category text classification models

Published: 07 August 2005 Publication History

Abstract

We present a kernel-based algorithm for hierarchical text classification where the documents are allowed to belong to more than one category at a time. The classification model is a variant of the Maximum Margin Markov Network framework, where the classification hierarchy is represented as a Markov tree equipped with an exponential family defined on the edges. We present an efficient optimization algorithm based on incremental conditional gradient ascent in single-example subspaces spanned by the marginal dual variables. Experiments show that the algorithm can feasibly optimize training sets of thousands of examples and classification hierarchies consisting of hundreds of nodes. The algorithm's predictive accuracy is competitive with other recently introduced hierarchical multi-category or multilabel classification learning algorithms.

References

[1]
Altun, Y., Tsochantaridis, I., & Hofmann, T. (2003). Hidden markov support vector machines. ICML'03 (pp. 3--10).
[2]
Bertsekas, D. (1999). Nonlinear programming. Athena Scientific.
[3]
Cai, L., & Hofmann, T. (2004). Hierarchical document categorization with support vector machines. 13 ACM CIKM.
[4]
Cesa-Bianchi, N., Gentile, C., Tironi, A., & Zaniboni, L. (2004). Incremental algorithms for hierarchical classification. Neural Information Processing Systems.
[5]
Dekel, O., Keshet, J., & Singer, Y. (2004). Large margin hierarchical classification. ICML'04 (pp. 209--216).
[6]
Dumais, S. T., & Chen, H. (2000). Hierarchical classification of web content. SIGIR'00 (pp. 256--263).
[7]
Hofmann, T., Cai., L., & Ciaramita, M. (2003). Learning with taxonomies: Classifying documents and words. NIPS Workshop on Syntax, Semantics, and Statistics.
[8]
Koller, D., & Sahami, M. (1997). Hierarchically classifying documents using very few words. ICML'97 (pp. 170--178).
[9]
Lewis, D. D., Yang, Y., Rose, T. G., & Li, F. (2004). Rev1: A new benchmark collection for text categorization research. JMLR, 5, 361--397.
[10]
McCallum, A., Rosenfeld, R., Mitchell, T., & Ng, A. Y. (1998). Improving text classification by shrinkage in a hierarchy of classes. ICML'98 (pp. 359--367).
[11]
Taskar, B., Guestrin, C., & Koller, D. (2003). Maxmargin markov networks. Neural Information Processing Systems.
[12]
Tsochantaridis, I., Hofmann, T., Joachims, T., & Altun, Y. (2004). Support vector machine learning for interdependent and structured output spaces. ICML'04 (pp. 823--830).
[13]
Wainwright, M., & Jordan, M. (2003). Graphical models, exponential families, and variational inference (Technical Report 649). Department of Statistics, University of California, Berkeley.
[14]
WIPO (2001). World intellectual property organization. http://www.wipo.int/classifications/en.

Cited By

View all
  • (2023)An end-to-end neural framework using coarse-to-fine-grained attention for overlapping relational triple extractionNatural Language Engineering10.1017/S1351324923000050(1-24)Online publication date: 21-Feb-2023
  • (2022)Hierarchical Multilabel Ship Classification in Remote Sensing Images Using Label Relation GraphsIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2021.311111760(1-13)Online publication date: 2022
  • (2020)A survey on text classification and its applicationsWeb Intelligence10.3233/WEB-200442(1-12)Online publication date: 6-Aug-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICML '05: Proceedings of the 22nd international conference on Machine learning
August 2005
1113 pages
ISBN:1595931805
DOI:10.1145/1102351
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2005

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)4
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)An end-to-end neural framework using coarse-to-fine-grained attention for overlapping relational triple extractionNatural Language Engineering10.1017/S1351324923000050(1-24)Online publication date: 21-Feb-2023
  • (2022)Hierarchical Multilabel Ship Classification in Remote Sensing Images Using Label Relation GraphsIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2021.311111760(1-13)Online publication date: 2022
  • (2020)A survey on text classification and its applicationsWeb Intelligence10.3233/WEB-200442(1-12)Online publication date: 6-Aug-2020
  • (2019)Hierarchical Multi-label Text ClassificationProceedings of the 28th ACM International Conference on Information and Knowledge Management10.1145/3357384.3357885(1051-1060)Online publication date: 3-Nov-2019
  • (2019)A Survey of Hierarchical Classification Algorithms with Big-Bang Approach2019 5th International Conference on Science and Technology (ICST)10.1109/ICST47872.2019.9166313(1-6)Online publication date: Jul-2019
  • (2018)Dual set multi-label learningProceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence10.5555/3504035.3504480(3635-3642)Online publication date: 2-Feb-2018
  • (2018)Multi-label classification of documents using fine-grained weights and modified co-trainingIntelligent Data Analysis10.3233/IDA-16326422:1(103-115)Online publication date: 22-Feb-2018
  • (2018)Learning with Latent Label Hierarchy from Incomplete Multi-Label Data2018 24th International Conference on Pattern Recognition (ICPR)10.1109/ICPR.2018.8545329(2075-2080)Online publication date: Aug-2018
  • (2018)Multi-label Learning with Missing Labels Using Mixed Dependency GraphsInternational Journal of Computer Vision10.1007/s11263-018-1085-3126:8(875-896)Online publication date: 1-Aug-2018
  • (2017)Soft estimation by hierarchical classification and regressionNeurocomputing10.1016/j.neucom.2016.12.037234:C(27-37)Online publication date: 19-Apr-2017
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media