Joint Semi-supervised Similarity Learning for Linear Classification

Nicolae, Maria-Irina; Gaussier, Éric; Habrard, Amaury; Sebban, Marc

doi:10.1007/978-3-319-23528-8_37

Maria-Irina Nicolae^10,11,
Éric Gaussier¹¹,
Amaury Habrard¹⁰ &
…
Marc Sebban¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9284))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

Abstract

The importance of metrics in machine learning has attracted a growing interest for distance and similarity learning. We study here this problem in the situation where few labeled data (and potentially few unlabeled data as well) is available, a situation that arises in several practical contexts. We also provide a complete theoretical analysis of the proposed approach. It is indeed worth noting that the metric learning research field lacks theoretical guarantees that can be expected on the generalization capacity of the classifier associated to a learned metric. The theoretical framework of $(\epsilon , \gamma , \tau )$-good similarity functions [1] has been one of the first attempts to draw a link between the properties of a similarity function and those of a linear classifier making use of it. In this paper, we extend this theory to a method where the metric and the separator are jointly learned in a semi-supervised way, setting that has not been explored before, and provide a theoretical analysis of this joint learning via Rademacher complexity. Experiments performed on standard datasets show the benefits of our approach over state-of-the-art methods.

Download to read the full chapter text

Chapter PDF

Algorithmic Robustness for Semi-Supervised $$(\epsilon , \gamma , \tau )$$ -Good Metric Learning

Generalization bounds for metric and similarity learning

Article 20 June 2015

UNIT: A unified metric learning framework based on maximum entropy regularization

Article 26 July 2023

Keywords

References

Balcan, M.-F., Blum, A., Srebro, N.: Improved guarantees for learning via similarity functions. In: COLT, pp. 287–298. Omnipress (2008)
Google Scholar
Bao, J.-P., Shen, J.-Y., Liu, X.-D., Liu, H.-Y.: Quick asymmetric text similarity measures. ICMLC 1, 374–379 (2003)
Google Scholar
Baoli, L., Qin, L., Shiwen, Y.: An adaptive k-nearest neighbor text categorization strategy. ACM TALIP (2004)
Google Scholar
Bellet, A., Habrard, A., Sebban, M.: Similarity learning for provably accurate sparse linear classification. In: ICML, pp. 1871–1878 (2012)
Google Scholar
Bellet, A., Habrard, A., Sebban, M.: A survey on metric learning for feature vectors and structured data. arXiv preprint arXiv:1306.6709 (2013)
Bellet, A., Habrard, A., Sebban, M.: Metric Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan & Claypool Publishers (2015)
Google Scholar
Boucheron, S., Bousquet, O., Lugosi, G.: Theory of classification : a survey of some recent advances. ESAIM: Probability and Statistics 9, 323–375 (2005)
Google Scholar
Bousquet, O., Elisseeff, A.: Stability and generalization. JMLR 2, 499–526 (2002)
MathSciNet MATH Google Scholar
Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: ICML, pp. 209–216. ACM, New York (2007)
Google Scholar
Diligenti, M., Maggini, M., Rigutini, L.: Learning similarities for text documents using neural networks. In: ANNPR (2003)
Google Scholar
Freund, Y., Schapire, R.E.: Large margin classification using the perceptron algorithm. Machine Learning 37(3), 277–296 (1999)
Article MATH Google Scholar
Grabowski, M., Szałas, A.: A Technique for Learning Similarities on Complex Structures with Applications to Extracting Ontologies. In: Szczepaniak, P.S., Kacprzyk, J., Niewiadomski, A. (eds.) AWIC 2005. LNCS (LNAI), vol. 3528, pp. 183–189. Springer, Heidelberg (2005)
Chapter Google Scholar
Guo, Z.-C., Ying, Y.: Guaranteed classification via regularized similarity learning. CoRR, abs/1306.3108 (2013)
Google Scholar
Hoi, S.C.H., Liu, W., Chang, S.-F.: Semi-supervised distance metric learning for collaborative image retrieval. In: CVPR (2008)
Google Scholar
Hoi, S.C.H., Liu, W., Chang, S.-F.: Semi-supervised distance metric learning for collaborative image retrieval and clustering. TOMCCAP 6(3) (2010)
Google Scholar
Hust, A.: Learning Similarities for Collaborative Information Retrieval. In: Machine Learning and Interaction for Text-Based Information Retrieval Workshop, TIR 2004, pp. 43–54 (2004)
Google Scholar
Ledoux, M., Talagrand, M.: Probability in Banach Spaces: Isoperimetry and Processes. Springer, New York (1991)
Google Scholar
Nicolae, M.-I., Sebban, M., Habrard, A., Gaussier, É., Amini, M.: Algorithmic robustness for learning via ($\epsilon $, $\gamma $, $\tau $)-good similarity functions. CoRR, abs/1412.6452 (2014)
Google Scholar
Niu, G., Dai, B., Yamada, M., Sugiyama, M.: Information-theoretic semi-supervised metric learning via entropy regularization. In: ICML. Omnipress (2012)
Google Scholar
Qamar, A.M., Gaussier, É.: Online and batch learning of generalized cosine similarities. In: ICDM, pp. 926–931 (2009)
Google Scholar
Qamar, A.M., Gaussier, É., Chevallet, J., Lim, J.: Similarity learning for nearest neighbor classification. In: ICDM, pp. 983–988 (2008)
Google Scholar
Shalev-Shwartz, S., Singer, Y., Ng, A.Y.: Online and batch learning of pseudo-metrics. In: ICML. ACM, New York (2004)
Google Scholar
Weinberger, K., Saul, L.: Fast solvers and efficient implementations for distance metric learning. In: ICML, pp. 1160–1167. ACM (2008)
Google Scholar
Weinberger, K., Saul, L.: Distance metric learning for large margin nearest neighbor classification. JMLR 10, 207–244 (2009)
MATH Google Scholar
Xing, E.P., Ng, A.Y., Jordan, M.I., Russell, S.: Distance metric learning, with application to clustering with side-information. NIPS 15, 505–512 (2002)
Google Scholar
Xu, H., Mannor, S.: Robustness and generalization. In: COLT, pp. 503–515 (2010)
Google Scholar
Xu, H., Mannor, S.: Robustness and generalization. Machine Learning 86(3), 391–423 (2012)
Article MathSciNet MATH Google Scholar
Zha, Z.-J., Mei, T., Wang, M., Wang, Z., Hua, X.-S.: Robust distance metric learning with auxiliary knowledge. In: IJCAI, pp. 1327–1332 (2009)
Google Scholar
Zhu, J., Rosset, S., Hastie, T., Tibshirani, R.: 1-norm support vector machines. In: NIPS, page 16. MIT Press (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Université Jean Monnet, Laboratoire Hubert Curien, Saint-Étienne, France
Maria-Irina Nicolae, Amaury Habrard & Marc Sebban
Université Grenoble Alpes, CNRS-LIG/AMA, Saint-Martin-d’Héres, France
Maria-Irina Nicolae & Éric Gaussier

Authors

Maria-Irina Nicolae
View author publications
You can also search for this author in PubMed Google Scholar
Éric Gaussier
View author publications
You can also search for this author in PubMed Google Scholar
Amaury Habrard
View author publications
You can also search for this author in PubMed Google Scholar
Marc Sebban
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Maria-Irina Nicolae .

Editor information

Editors and Affiliations

University of Bari Aldo Moro, Bari, Italy
Annalisa Appice
University of Porto, Porto, Portugal
Pedro Pereira Rodrigues
University of Porto - CRACS/INESC TEC, Porto, Portugal
Vítor Santos Costa
University of Porto - INESC TEC, Porto, Portugal
Carlos Soares
University of Porto - INESC TEC, Porto, Portugal
João Gama
University of Porto - INESC TEC, Porto, Portugal
Alípio Jorge

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nicolae, MI., Gaussier, É., Habrard, A., Sebban, M. (2015). Joint Semi-supervised Similarity Learning for Linear Classification. In: Appice, A., Rodrigues, P., Santos Costa, V., Soares, C., Gama, J., Jorge, A. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2015. Lecture Notes in Computer Science(), vol 9284. Springer, Cham. https://doi.org/10.1007/978-3-319-23528-8_37

Download citation

DOI: https://doi.org/10.1007/978-3-319-23528-8_37
Published: 29 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23527-1
Online ISBN: 978-3-319-23528-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Joint Semi-supervised Similarity Learning for Linear Classification

Abstract

Chapter PDF

Similar content being viewed by others

Algorithmic Robustness for Semi-Supervised $$(\epsilon , \gamma , \tau )$$ -Good Metric Learning

Generalization bounds for metric and similarity learning

UNIT: A unified metric learning framework based on maximum entropy regularization

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Joint Semi-supervised Similarity Learning for Linear Classification

Abstract

Chapter PDF

Similar content being viewed by others

Algorithmic Robustness for Semi-Supervised $$(\epsilon , \gamma , \tau )$$ -Good Metric Learning

Generalization bounds for metric and similarity learning

UNIT: A unified metric learning framework based on maximum entropy regularization

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation