Margin-Based Active Learning for Structured Output Spaces

Roth, Dan; Small, Kevin

doi:10.1007/11871842_40

Dan Roth²¹ &
Kevin Small²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

6333 Accesses
43 Citations

Abstract

In many complex machine learning applications there is a need to learn multiple interdependent output variables, where knowledge of these interdependencies can be exploited to improve the global performance. Typically, these structured output scenarios are also characterized by a high cost associated with obtaining supervised training data, motivating the study of active learning for these situations. Starting with active learning approaches for multiclass classification, we first design querying functions for selecting entire structured instances, exploring the tradeoff between selecting instances based on a global margin or a combination of the margin of local classifiers. We then look at the setting where subcomponents of the structured instance can be queried independently and examine the benefit of incorporating structural information in such scenarios. Empirical results on both synthetic data and the semantic role labeling task demonstrate a significant reduction in the need for supervised training data when using the proposed methods.

Download to read the full chapter text

Chapter PDF

Hierarchical Active Learning with Proportion Feedback on Regions

Active learning for hierarchical multi-label classification

Article 17 July 2020

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

Article 01 November 2018

References

Carreras, X., Màrquez, L.: Introduction to the CoNLL-2004 shared tasks: Semantic role labeling. In: Proc. of the Conference on Computational Natural Language Learning (CoNLL) (2004)
Google Scholar
Collins, M.: Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In: Proc. of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2002)
Google Scholar
Punyakanok, V., Roth, D., Yih, W., Zimak, D.: Learning and inference over constrained output. In: Proc. of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 1124–1129 (2005)
Google Scholar
Tong, S., Koller, D.: Support vector machine active learning with applications to text classification. Journal of Machine Learning Research 2, 45–66 (2001)
Article Google Scholar
Yan, R., Yang, J., Hauptmann, A.: Automatically labeling video data using multi-class active learning. In: Proc. of the International Conference on Computer Vision (ICCV), pp. 516–523 (2003)
Google Scholar
Daumé III, H., Marcu, D.: Learning as search optimization: Approximate large margin methods for structured prediction. In: Proc. of the International Conference on Machine Learning (ICML) (2005)
Google Scholar
Har-Peled, S., Roth, D., Zimak, D.: Constraint classification for multiclass classification and ranking. In: The Conference on Advances in Neural Information Processing Systems (NIPS), pp. 785–792 (2003)
Google Scholar
Tsochantaridis, I., Hofmann, T., Joachims, T., Altun, Y.: Support vector machine learning for interdependent and structured output spaces. In: Proc. of the International Conference on Machine Learning (ICML), pp. 823–830 (2004)
Google Scholar
Punyakanok, V., Roth, D., Yih, W., Zimak, D.: Semantic role labeling via integer linear programming inference. In: Proc. the International Conference on Computational Linguistics (COLING) (2004)
Google Scholar
Thompson, C.A., Califf, M.E., Mooney, R.J.: Active learning for natural language parsing and information extraction. In: Proc. of the International Conference on Machine Learning (ICML), pp. 406–414 (1999)
Google Scholar
Hwa, R.: Sample selection for statistical grammar induction. In: Proc. of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2000)
Google Scholar
Baldridge, J., Osbourne, M.: Active learning for HPSG parse selection. In: Proc. of the Annual Meeting of the North American Association of Computational Linguistics (NAACL), pp. 17–24 (2003)
Google Scholar
Scheffer, T., Wrobel, S.: Active learning of partially hidden markov models. In: Hoffmann, F., Adams, N., Fisher, D., Guimarães, G., Hand, D.J. (eds.) IDA 2001. LNCS, vol. 2189, p. 309. Springer, Heidelberg (2001)
Chapter Google Scholar
Anderson, B., Moore, A.: Active learning for hidden markov models: Objective functions and algorithms. In: Proc. of the International Conference on Machine Learning (ICML) (2005)
Google Scholar
Culotta, A., McCallum, A.: Reducing labeling effort for stuctured prediction tasks. In: Proceedings of the National Conference on Artificial Intelligence (AAAI) (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Illinois at Urbana-Champaign, 201 N. Goodwin Avenue, Urbana, IL, 61801, USA
Dan Roth & Kevin Small

Authors

Dan Roth
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Small
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roth, D., Small, K. (2006). Margin-Based Active Learning for Structured Output Spaces. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_40

Download citation

DOI: https://doi.org/10.1007/11871842_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Margin-Based Active Learning for Structured Output Spaces

Abstract

Chapter PDF

Similar content being viewed by others

Hierarchical Active Learning with Proportion Feedback on Regions

Active learning for hierarchical multi-label classification

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Margin-Based Active Learning for Structured Output Spaces

Abstract

Chapter PDF

Similar content being viewed by others

Hierarchical Active Learning with Proportion Feedback on Regions

Active learning for hierarchical multi-label classification

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation