Density estimation by stochastic complexity

J Rissanen, TP Speed, B Yu - IEEE Transactions on …, 1992 - ieeexplore.ieee.org
J Rissanen, TP Speed, B Yu
IEEE Transactions on Information Theory, 1992ieeexplore.ieee.org
The results by P. Hall and EJ Hannan (1988) on optimization of histogram density estimators
with equal bin widths by minimization of the stochastic complexity are extended and
sharpened in two separate ways. As the first contribution, two generalized histogram
estimators are constructed. The first has unequal bin widths which, together with the number
of the bins, are determined by minimization of the stochastic complexity using dynamic
programming. The other estimator consists of a mixture of equal bin width estimators, each …
The results by P. Hall and E.J. Hannan (1988) on optimization of histogram density estimators with equal bin widths by minimization of the stochastic complexity are extended and sharpened in two separate ways. As the first contribution, two generalized histogram estimators are constructed. The first has unequal bin widths which, together with the number of the bins, are determined by minimization of the stochastic complexity using dynamic programming. The other estimator consists of a mixture of equal bin width estimators, each of which is defined by the associated stochastic complexity. As the main contribution in the present work, two theorems are proved, which together extend the universal coding theorems to a large class of data generating densities. The first gives an asymptotic upper bound for the code redundancy in the order of magnitude, achieved with a special predictive type of histogram estimator, which sharpens a related bound. The second theorem states that this bound cannot be improved upon by any code whatsoever.< >
ieeexplore.ieee.org