Abstract
The classical chi-squared goodness of fit test assumes the number of classes is fixed, meanwhile the test statistic has a limiting chi-square distribution under the null hypothesis. It is well known that the number of classes varying with sample size in the test has attached more and more attention. However, in this situation, there is not theoretical results for the asymptotic property of such chi-squared test statistic. This paper proves the consistency of chi-squared test with varying number of classes under some conditions. Meanwhile, the authors also give a convergence rate of Kolmogorov-Simirnov distance between the test statistic and corresponding chi-square distributed random variable. In addition, a real example and simulation results validate the reasonability of theoretical result and the superiority of chi-squared test with varying number of classes.
Similar content being viewed by others
References
Pearson K, On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling, Philosophical Magazine, 1900, 50(302): 157–175.
Cochran W G, The chi-square test of goodness of fit, The Annals of Mathematical Statistics, 1952, 23(3): 315–345.
Mann H B and Wald A, On the choice of the number of class intervals in the application of the chi-square test, The Annals of Mathematical Statistics, 1942, 13(3): 306–317.
Scott D W, On optimal and data-based histograms, Biometrika Trust, 1979, 66(3): 605–610.
Williams C A, On the choice of the number and width of classes for the chi-square test of goodness of fit, Journal of the American Statistical Association, 1950, 45(249): 77–86.
Rayner J C W and Best D J, The choice of class probabilities and number of classes for the simple X 2 goodness of fit test, The Indian Journal of Statistics, 1982, 44(1): 28–38.
Hamdan M A, The number and width of classes in the chi-square test, Journal of the American Statistical Association, 1963, 58(303): 678–689.
Dahiya R C and Gurland J, How many classes in the Pearson chi-square test, Journal of the American Statistical Association, 1973, 68(343): 707–712.
Kallenberg W C M, Oosterhoff J, and Schriever B F, The number of classes in chi-squared goodness of fit tests, Journal of the American Statistical Association, 1985, 80(392): 959–968.
Götze F and Tikhomirov A N, Asymptotic distribution of quadratic forms, The Annals of Probability, 1999, 27(2): 1072–1098.
Götze F and Tikhomirov A N, Asymptotic distribution of quadratic forms and applications, Journal of Theoretical Probability, 2002, 15(2): 423–475.
Noel C and Timothy R C R, Multinomial goodness of fit tests, Journal of the Royal Statistical Society, 1984, 46(3): 440–464.
Whittle P, On the convergence to normality of quadratic forms in independent variables, Theory of Probability and Its Applications, 1964, 9: 103–108.
Bentkus V, Dependence of the Berry-Esseen estimate on the dimension, Lithuanian Mathematical Journal, 1986, 26: 205–210.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported by the Natural Science Foundation of China under Grant Nos. 11071022, 11028103, 11231010, 11471223, BCMIIS and the Beijing Municipal Educational Commission Foundation under Grant Nos. KZ201410028030, KM201210028005, and Jishou University Subject in 2014 (No: 14JD035).
This paper was recommended for publication by Editor SUN Liuquan.
Rights and permissions
About this article
Cite this article
Huang, R., Cui, H. Consistency of chi-squared test with varying number of classes. J Syst Sci Complex 28, 439–450 (2015). https://doi.org/10.1007/s11424-015-3051-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11424-015-3051-2