Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Bounds on the Sample Complexity of Bayesian Learning Using Information Theory and the VC Dimension

Published: 01 January 1994 Publication History
  • Get Citation Alerts
  • Abstract

    In this paper we study a Bayesian or average-case model of concept learning with a twofold goal: to provide more precise characterizations of learning curve (sample complexity) behavior that depend on properties of both the prior distribution over concepts and the sequence of instances seen by the learner, and to smoothly unite in a common framework the popular statistical physics and VC dimension theories of learning curves. To achieve this, we undertake a systematic investigation and comparison of two fundamental quantities in learning and information theory: the probability of an incorrect prediction for an optimal learning algorithm, and the Shannon information gain. This study leads to a new understanding of the sample complexity of learning in several existing models.

    Cited By

    View all
    • (2017)Risk upper bounds for general ensemble methods with an application to multiclass classificationNeurocomputing10.1016/j.neucom.2016.09.016219:C(15-25)Online publication date: 5-Jan-2017
    • (2011)A general system for automatic biomedical image segmentation using intensity neighborhoodsJournal of Biomedical Imaging10.5555/1992576.20433162011(1-12)Online publication date: 1-Jan-2011
    • (2010)Stochastic refinementProceedings of the 20th international conference on Inductive logic programming10.5555/2022735.2022763(222-237)Online publication date: 27-Jun-2010
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Machine Language
    Machine Language  Volume 14, Issue 1
    Special issue on computational learning theory
    Jan. 1994
    125 pages
    ISSN:0885-6125
    Issue’s Table of Contents

    Publisher

    Kluwer Academic Publishers

    United States

    Publication History

    Published: 01 January 1994

    Author Tags

    1. Bayesian learning
    2. VC dimension
    3. average-case learning
    4. information theory
    5. learning curves
    6. statistical physics

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 26 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)Risk upper bounds for general ensemble methods with an application to multiclass classificationNeurocomputing10.1016/j.neucom.2016.09.016219:C(15-25)Online publication date: 5-Jan-2017
    • (2011)A general system for automatic biomedical image segmentation using intensity neighborhoodsJournal of Biomedical Imaging10.5555/1992576.20433162011(1-12)Online publication date: 1-Jan-2011
    • (2010)Stochastic refinementProceedings of the 20th international conference on Inductive logic programming10.5555/2022735.2022763(222-237)Online publication date: 27-Jun-2010
    • (2010)Bayesian active learning using arbitrary binary valued queriesProceedings of the 21st international conference on Algorithmic learning theory10.5555/1893193.1893204(50-58)Online publication date: 6-Oct-2010
    • (2010)Model Selection: Beyond the Bayesian/Frequentist DivideThe Journal of Machine Learning Research10.5555/1756006.175600911(61-87)Online publication date: 1-Mar-2010
    • (2008)QG/GAMachine Language10.1007/s10994-007-5029-370:2-3(121-133)Online publication date: 1-Mar-2008
    • (2007)Estimating the size of neural networks from the number of available training dataProceedings of the 17th international conference on Artificial neural networks10.5555/1776814.1776823(68-77)Online publication date: 9-Sep-2007
    • (2006)A hybrid bayesian optimal classifier based on neuro-fuzzy logicProceedings of the Second international conference on Advances in Natural Computation - Volume Part I10.1007/11881070_49(341-350)Online publication date: 24-Sep-2006
    • (2003)Distinctive Features of Minimization of a Risk Functional in Mass Data SetsCybernetics and Systems Analysis10.1023/B:CASA.0000003500.91054.5b39:4(501-508)Online publication date: 1-Jul-2003
    • (2002)On the complexity of computing and learning with multiplicative neural networksNeural Computation10.1162/0899766025274112114:2(241-301)Online publication date: 1-Feb-2002
    • Show More Cited By

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media