Abstract
Going beyond the traditional text classification, involving a few tens of classes, there has been a surge of interest in automatic document categorization in large taxonomies where the number of classes range from hundreds of thousands to millions. Due to the complex nature of the learning problem posed in such scenarios, one needs to adapt the conventional classification schemes to suit this domain. This paper presents a novel approach for classifier selection in large hierarchies, which is based on exploiting training data heterogeneity across the hierarchy. We also present a meta-learning framework for further flexibility in classifier selection. The experimental results demonstrate the applicability of our approach, which achieves accuracy comparable to the state-of-the-art and is also significantly faster for prediction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Babbar, R., Partalas, I., Gaussier, E., Amblard, C.: On empirical tradeoffs in large scale hierarchical classification. In: ACM CIKM (2012)
Bennett, N.P., Nguyen, N.: Refined experts: improving classification in large taxonomies. In: Int. ACM SIGIR Conference, pp. 11–18 (2009)
Cai, L., Hofmann, T.: Hierarchical document categorization with support vector machines. In: CIKM, pp. 78–87 (2004)
Fan, E.R., Chang, W.K., Hsieh, J.C., Wang, R.X., Lin, J.C.P.: LIBLINEAR: A library for large linear classification. JMLR 9, 1871–1874 (2008)
Liu, Y.T., Yang, Y., Wan, H., Zeng, J.H., Chen, Z., Ma, Y.W.: Support vector machines classification with a very large-scale taxonomy. SIGKDD Explor. Newsl., 36–43 (2005)
Ng, Y.A., Jordan, I.M.: On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. In: NIPS, pp. 841–848 (2001)
Schaul, T., Schmidhuber, J.: Metalearning. Scholarpedia 5, 4650 (2010)
Secker, A., Davies, N.M., Freitas, A.A., Clark, B.E., Timmis, J., Flower, R.D.: Hierarchical classification of g-protein-coupled receptors with data-driven selection of attributes and classifiers. Int. J. Data Min. Bioinformatics, 91–210 (2010)
Xue, R.G., Xing, D., Yang, Q., Yu, Y.: Deep classification in large-scale text hierarchies. In: Int. ACM SIGIR Conference, pp. 619–626 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Partalas, I., Babbar, R., Gaussier, E., Amblard, C. (2012). Adaptive Classifier Selection in Large-Scale Hierarchical Classification. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7665. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34487-9_74
Download citation
DOI: https://doi.org/10.1007/978-3-642-34487-9_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34486-2
Online ISBN: 978-3-642-34487-9
eBook Packages: Computer ScienceComputer Science (R0)