Abstract
We consider distance-based similarity measures for real-valued vectors of interest in kernel-based machine learning algorithms. In particular, a truncated Euclidean similarity measure and a self-normalized similarity measure related to the Canberra distance. It is proved that they are positive semi-definite (p.s.d.), thus facilitating their use in kernel-based methods, like the Support Vector Machine, a very popular machine learning tool. These kernels may be better suited than standard kernels (like the RBF) in certain situations, that are described in the paper. Some rather general results concerning positivity properties are presented in detail as well as some interesting ways of proving the p.s.d. property.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
BERG, C. CHRISTENSEN, J.P.R. and RESSEL, P. (1984): Harmonic Analysis on Semi-groups: Theory of Positive Definite and Related Functions, Springer.
CHANDON, J.L. and PINSON, S. (1981): Analyse Typologique. Théorie et Applications, Masson, Paris.
FOWLKES, C., BELONGIE, S., CHUNG, F., and MALIK, J. (2004): Spectral Grouping Us-ing the Nyström Method. IEEE Trans. on PAMI, 26(2), 214-225.
GOWER, J.C. (1971): A general coefficient of similarity and some of its properties, Biometrics 27,857-871.
HORN, R.A. and JOHNSON, C.R. (1991): Topics in Matrix Analysis, Cambridge University Press.
KOKARE, M., CHATTERJI, B.N. and BISWAS, P.K. (2003): Comparison of similarity metrics for texture image retrieval. In: IEEE Conf. on Convergent Technologies for AsiaPacific Region, 571-575.
SHAWE-TAYLOR, J. and CRISTIANINI, N. (2004): Kernel Methods for Pattern Analysis, Cambridge University Press.
VAPNIK. V. (1998): The Nature of Statistical Learning Theory. Springer-Verlag.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Belanche, L., Vázquez, J.L., Vázquez, M. (2008). Distance-Based Kernels for Real-Valued Data. In: Preisach, C., Burkhardt, H., Schmidt-Thieme, L., Decker, R. (eds) Data Analysis, Machine Learning and Applications. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78246-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-78246-9_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78239-1
Online ISBN: 978-3-540-78246-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)