Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1642293.1642473guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Learning and inference over constrained output

Published: 30 July 2005 Publication History

Abstract

We study learning structured output in a discriminative framework where values of the output variables are estimated by local classifiers. In this framework, complex dependencies among the output variables are captured by constraints and dictate which global labels can be inferred. We compare two strategies, learning independent classifiers and inference based training, by observing their behaviors in different conditions. Experiments and theoretical justification lead to the conclusion that using inference based learning is superior when the local classifiers are difficult to learn but may require many examples before any discernible difference can be observed.

References

[1]
Anthony and Bartlett, 1999} M. Anthony and P. Bartlett. Neural Network Learning: Theoretical Foundations. Cambridge University Press, 1999.
[2]
{Carreras and Màrquez, 2003} X. Carreras and Lluís Màrquez. On-line learning via global feedback for phrase recognition. In Advances in Neural Information Processing Systems 15, 2003.
[3]
{Collins, 2002} M. Collins. Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms. In Proceedings of EMNLP, 2002.
[4]
{Har-Peled et al., 2003} S. Har-Peled, D. Roth, and D. Zimak. Constraint classification: A new approach to multiclass classification and ranking. In Advances in Neural Information Processing Systems 15, 2003.
[5]
{Kingsbury and Palmer, 2002} P. Kingsbury and M. Palmer. From Treebank to PropBank. In Proceedings of LREC, 2002.
[6]
{Lafferty et al., 2001} J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. ICML-01, pages 282-289, 2001.
[7]
{Marcus et al., 1993} M. P. Marcus, B. Santorini, and M. Marcinkiewicz. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313-330, June 1993.
[8]
{McCallum et al., 2000} A. McCallum, D. Freitag, and F. Pereira. Maximum entropy Markov models for information extraction and segmentation. In Proceedings of ICML-00, Stanford, CA, 2000.
[9]
{Punyakanok and Roth, 2001} V. Punyakanok and D. Roth. The use of classifiers in sequential inference. In Advances in Neural Information Processing Systems 13, 2001.
[10]
{Punyakanok et al., 2004} V. Punyakanok, D. Roth, W. Yih, and D. Zimak. Semantic role labeling via integer linear programming inference. In Proceedings of COLING-04, 2004.
[11]
{Ramshaw and Marcus, 1995} L. A. Ramshaw and M. P. Marcus. Text chunking using transformation-based learning. In Proceedings of the Third Annual Workshop on Very Large Corpora, 1995.
[12]
{Roth and Yih, 2004} D. Roth and W. Yih. A linear programming formulation for global inference in natural language tasks. In Proceedings of CoNLL-2004, pages 1-8, 2004.
[13]
{Taskar et al., 2004} B. Taskar, C. Guestrin, and D. Koller. Maxmargin markov networks. In Advances in Neural Information Processing Systems 16, 2004.

Cited By

View all
  • (2023)On regularization and inference with label constraintsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619893(35740-35762)Online publication date: 23-Jul-2023
  • (2021)Long Horizon Forecasting with Temporal Point ProcessesProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441740(571-579)Online publication date: 8-Mar-2021
  • (2018)Learning pipelines with limited data and domain knowledgeProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3326943.3326957(140-151)Online publication date: 3-Dec-2018
  • Show More Cited By
  1. Learning and inference over constrained output

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    IJCAI'05: Proceedings of the 19th international joint conference on Artificial intelligence
    July 2005
    1775 pages

    Sponsors

    • The International Joint Conferences on Artificial Intelligence, Inc.

    Publisher

    Morgan Kaufmann Publishers Inc.

    San Francisco, CA, United States

    Publication History

    Published: 30 July 2005

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)On regularization and inference with label constraintsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619893(35740-35762)Online publication date: 23-Jul-2023
    • (2021)Long Horizon Forecasting with Temporal Point ProcessesProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441740(571-579)Online publication date: 8-Mar-2021
    • (2018)Learning pipelines with limited data and domain knowledgeProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3326943.3326957(140-151)Online publication date: 3-Dec-2018
    • (2015)SaulProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832415.2832505(1844-1851)Online publication date: 25-Jul-2015
    • (2015)AD3The Journal of Machine Learning Research10.5555/2789272.278928816:1(495-545)Online publication date: 1-Jan-2015
    • (2015)Global machine learning for spatial ontology populationWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2014.06.00130:C(3-21)Online publication date: 1-Jan-2015
    • (2012)Unsupervised learning on an approximate corpusProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies10.5555/2382029.2382048(131-141)Online publication date: 3-Jun-2012
    • (2011)A joint model for extended semantic role labelingProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145447(129-139)Online publication date: 27-Jul-2011
    • (2011)PunctuationProceedings of the Fifteenth Conference on Computational Natural Language Learning10.5555/2018936.2018939(19-28)Online publication date: 23-Jun-2011
    • (2010)Knowing what to believe (when you already know something)Proceedings of the 23rd International Conference on Computational Linguistics10.5555/1873781.1873880(877-885)Online publication date: 23-Aug-2010
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media