Abstract
Selecting a statistical model from a set of competing models is a central issue in the scientific task, and the Bayesian approach to model selection is based on the posterior model distribution, a quantification of the updated uncertainty on the entertained models. We present a Bayesian procedure for choosing a family between the Poisson and the geometric families and prove that the procedure is consistent with rate \(O(a^{n})\), \(a>1\), where a is a function of the parameter of the true model. An extension of this procedure to the multiple testing Poisson and negative binomial with r successes for \(r=1,\ldots ,L\) is also proved to be consistent with exponential rate. For small sample sizes, a simulation study indicates that the model selection between the above distributions is made with large uncertainty when sampling from a specific subset of distributions. This difficulty is however mitigated by the large consistency rate of the procedure.
Similar content being viewed by others
References
Atkinson AC (1970) A method for discriminating between models. J R Stat Soc Ser B 32:323–344
Bayarri MJ, Berger JO, Forte A, García-Donato G (2012) Criteria for Bayesian model choice with application to variable selection. Ann Stat 40:1550–1577
Berger JO, Pericchi LR (1996) The intrinsic Bayes factor for linear models (with discussion). In: Bernardo JM et al (eds) Bayesian statistics 5. Oxford University Press, New York, pp 25–44
Berger JO, Pericchi L (1996) The intrinsic Bayes factor for model selection and prediction. J Am Statist Assoc 91:109–122
Casella G, Girón FJ, Martínez ML, Moreno E (2009) Consistency of Bayesian procedures for variable selection. Ann Stat 37:1207–1228
Casella G, Moreno E, Girón FJ (2014) Cluster analysis, model selection, and prior distributions on models. Bayesian Anal 9(3):613–658
Cox DR (1961) Tests of separate families of hypotheses. In: Proccedings 4th Berkeley symposium in mathematical statistics and probability, University of California Press, vol 1, pp 105–123
Cox DR (2013) A return to an old paper: Tests of separate families of hypotheses (with discussion). J R Statist Soc Ser B 75:207–215
Cox DR, Hinkley DV (1974) Theoretical statistics. Chapman and Hall, London
Dauxois JY, Druihet P, Pommeret D (2006) A Bayesian choice between Poisson. Binomial and Negative Binomial models. TEST 15:423–432
Dawid AP (2011) Posterior model probabilities. In: Bandyopadhyay S, Forster M (eds) Philosophy of statistics. Elsevier, New York, pp 607–630
Dawid AP, Musio M (2017) Columbu, S, A note on Bayesian model selection for discrete data using proper scoring rules. Statist Probab Lett 129:101–106
Fouskakis D, Ntzoufras I, Draper D (2015) Power-expected-posterior priors for variable selection in gaussian linear models. Bayesian Anal 10(1):75–107
Klugman SA, Panjer HH, Willmot GE (2008) Loss models: from data to decisions, 3rd edn. Wiley, New York
Liang F, Paulo R, Molina G, Clyde M, Berger JO (2008) Mixtures of \(g\)-priors for Bayesian variable selection. J Am Statist Assoc 103:410–423
Moreno E, Bertolino F, Racugno W (1998) An Intrinsic limiting procedure for model selection and hypothesis testing. J Am Statist Assoc 93:1451–1460
Moreno E (2005) Objective Bayesian analysis for one-sided testing. Test 14:181–198
Moreno E, Girón FJ (2005) Consistency of Bayes factors for linear models. CR Acad Sci Paris Ser I 340:911–914
Moreno E, Girón FJ, Casella G (2010) Consistency of objective Bayes factors as the model dimension grows. Ann Statist 38:1937–1952
Moreno E, Girón FJ, Casella G (2015) Posterior model consistency in variable selection as the model dimension grows. Statist Sci 30(2):228–241
O’Hagan A (1994) Bayesian inference. Kendall’s Advanced Theory of Statistic, vol 2B. Edward Arnold, London
Pérez JM, Berger JO (2002) Expected-posterior prior distributions for model selection. Biometrika 89:491–512
Pradhan B, Kundu D (2016) A choice between Poisson and geometric distribution. J Indian Soc Probab Stat 17:111–123
Scott JG, Berger JO (2010) Bayes and empirical Bayes multiplicity adjustment in the variable selection problem. Ann Stat 38(5):2587–2619
Tröbliger A (1961) Mathematische untersuchungen zur Beitragsrückgewähr in der Kraft-reiseversicherung. Blatter der Deutsche Gesellschaft für Versicherungsmathematik 5:327–348
Zellner A, Siow A (1980) Posterior odds ratios for selected regression hypotheses. TEST 31(1):585–603
Zellner A (1986) On assessing prior distributions and Bayesian regression analysis with \(g\)-prior distributions. In: Goel PK, Zellner A (eds) Bayesian inference and decision techniques: essays in honor of bruno de finetti. North-Holland, Amsterdam, pp 233–243
Acknowledgements
E. Moreno and F.J. Vázquez–Polo was partially funded by Grant ECO2017–85577–P from the Spanish Ministry of Economy and Competitiveness. The authors thank the Associate Editor and two referees for their suggestions which have improved the original manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
EM and FJVP partially funded by Grant ECO2017–85577–P from the Spanish Ministry of Economy and Competitiveness (MINECO, Spain). The authors thank the Associate Editor and two referees for their suggestions which have improved the original manuscript.
Rights and permissions
About this article
Cite this article
Moreno, E., Martínez, C. & Vázquez–Polo, F. Objective Bayesian model choice for non-nested families: the case of the Poisson and the negative binomial. TEST 30, 255–273 (2021). https://doi.org/10.1007/s11749-020-00717-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11749-020-00717-z
Keywords
- Bayesian model selection
- Test for separate families
- Consistency
- Sampling behavior for small sample sizes
- Rate of convergence