Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1390156.1390281acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
research-article

Tailoring density estimation via reproducing kernel moment matching

Published: 05 July 2008 Publication History

Abstract

Moment matching is a popular means of parametric density estimation. We extend this technique to nonparametric estimation of mixture models. Our approach works by embedding distributions into a reproducing kernel Hilbert space, and performing moment matching in that space. This allows us to tailor density estimators to a function class of interest (i.e., for which we would like to compute expectations). We show our density estimation approach is useful in applications such as message compression in graphical models, and image classification and retrieval.

References

[1]
Altun, Y., & Smola, A. (2006). Unifying divergence minimization and statistical inference via convex duality. In COLT 2006.
[2]
Barndorff-Nielsen, O. E. (1978). Information and Exponential Families in Statistical Theory.
[3]
Bartlett, P. L., & Mendelson, S. (2002). Rademacher and Gaussian complexities: Risk bounds and structural results. JMLR, 3, 463--482.
[4]
de Farias, N., & Roy, B. (2004). On constraint sampling in the linear programming approach to approximate dynamic programming. Math. Oper. Res., 29(3).
[5]
Doucet, A., de Freitas, N., & Gordon, N. (2001). Sequential Monte Carlo Methods in Practice. Springer-Verlag.
[6]
Dudík, M., Phillips, S., & Schapire, R. (2004). Performance guarantees for regularized maximum entropy density estimation. In COLT 2004.
[7]
DuMouchel, W., Volinsky, C., Cortes, C., Pregibon, D., & Johnson, T. (1999). Squashing flat files flatter. In KDD 1999.
[8]
Girolami, M., & He, C. (2003). Probability density estimation from optimally condensed data samples. IEEE TPAMI, 25(10), 1253--1264.
[9]
Greenspan, H., Dvir, G., & Rubner, Y. (2002). Context-based image modeling. In ICPR 2002.
[10]
Gretton, A., Borgwardt, K., Rasch, M., Schölkopf, B., & Smola, A. (2007). A kernel method for the two-sample-problem. In NIPS 2007.
[11]
Jebara, T., Kondor, R., & Howard, A. (2004). Probability product kernels. JMLR, 5, 819--844.
[12]
McLachlan, G., & Basford, K. (1988). Mixture Models: Inference and Applications to Clustering. Marcel Dekker.
[13]
Minka, T. (2001). Expectation Propagation for approximate Bayesian inference. Ph.D. thesis, MIT.
[14]
Parzen, E. (1962). On estimation of a probability density function and mode. Ann. Math. Stat., 33, 1065--1076.
[15]
Rubner, Y., Tomasi, C., & Guibas, L. (2000). The earth mover's distance as a metric for image retrieval. Intl. J. Computer Vision, 40(2), 99--121.
[16]
Shawe-Taylor, J., & Dolia, A. (2007). A framework for probability density estimation. In AISTATS 2007.
[17]
Silverman, B. W. (1986). Density estimation for statistical and data analysis. Monographs on statistics and applied probability. Chapman and Hall.
[18]
Smola, A., Gretton, A., Song, L., & Schöölkopf, B. (2007). A Hilbert space embedding for distributions. In ALT 2007.
[19]
Steinwart, I. (2002). The influence of the kernel on the consistency of support vector machines. JMLR, 2, 463--482.
[20]
Vapnik, V., & Mukherjee, S. (2000). Support vector method for multivariate density estimation. In NIPS 12.
[21]
Wainwright, M. J., & Jordan, M. I. (2003). Graphical models, exponential families, and variational inference. Tech. Rep. 649, UC Berkeley.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICML '08: Proceedings of the 25th international conference on Machine learning
July 2008
1310 pages
ISBN:9781605582054
DOI:10.1145/1390156
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Pascal
  • University of Helsinki
  • Xerox
  • Federation of Finnish Learned Societies
  • Google Inc.
  • NSF
  • Machine Learning Journal/Springer
  • Microsoft Research: Microsoft Research
  • Intel: Intel
  • Yahoo!
  • Helsinki Institute for Information Technology
  • IBM: IBM

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 July 2008

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

Conference

ICML '08
Sponsor:
  • Microsoft Research
  • Intel
  • IBM

Acceptance Rates

Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)1
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Unbalanced Optimal Transport and Maximum Mean Discrepancies: Interconnections and Rapid EvaluationJournal of Scientific Computing10.1007/s10915-024-02586-2100:3Online publication date: 1-Sep-2024
  • (2024)A non-classical parameterization for density estimation using sample momentsStatistical Papers10.1007/s00362-024-01563-zOnline publication date: 20-May-2024
  • (2024)Minimum Kernel Discrepancy EstimatorsMonte Carlo and Quasi-Monte Carlo Methods10.1007/978-3-031-59762-6_6(133-161)Online publication date: 13-Jul-2024
  • (2023)Variational weighting for kernel density ratiosProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3666344(5010-5027)Online publication date: 10-Dec-2023
  • (2023)Parametric and Multivariate Uncertainty Calibration for Regression and Object DetectionComputer Vision – ECCV 2022 Workshops10.1007/978-3-031-25072-9_30(426-442)Online publication date: 18-Feb-2023
  • (2022) Discovering causal structure with reproducing-kernel Hilbert space ε Chaos: An Interdisciplinary Journal of Nonlinear Science10.1063/5.006282932:2(023103)Online publication date: Feb-2022
  • (2022)Predicting the hydrodynamic properties of a bioreactor: Conditional density estimation as a surrogate model for CFD simulationsChemical Engineering Research and Design10.1016/j.cherd.2022.03.042182(342-359)Online publication date: Jun-2022
  • (2020)Predicting Pharmaceutical Particle Size Distributions Using Kernel Mean EmbeddingPharmaceutics10.3390/pharmaceutics1203027112:3(271)Online publication date: 16-Mar-2020
  • (2019)High-Order Sequential Simulation via Statistical Learning in Reproducing Kernel Hilbert SpaceMathematical Geosciences10.1007/s11004-019-09843-3Online publication date: 7-Dec-2019
  • (2019)Near-Optimal Coresets of Kernel Density EstimatesDiscrete & Computational Geometry10.1007/s00454-019-00134-6Online publication date: 25-Sep-2019
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media