Support Vector Machines

Chapter
First Online: 02 September 2022

pp 89–99
Cite this chapter

Machine Learning and Artificial Intelligence

Ameet V. Joshi²

3476 Accesses
5 Citations

Abstract

In this chapter we are going to study the concept of support vector machines as developed by Vapnik and others. The concept was first proposed as an alternative to neural networks, when neural networks were not performing up to the grand expectations that they came with. SVM proposed a very targeted mathematical approach towards finding the optimal solution in case of classification or regression. We will first study the original SVM theory that tries to solve the problem of linear classification. Then, we will see how it can be further generalized for nonlinear problems with use of kernels and also how it is extended for solving the problems of regression. The theory of SVM proposed an elegant solution towards optimization and generalization and more importantly was extremely successful in getting results that neural network-based methods only hoped for at the time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Hardcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Support Vector Machines

Chapter © 2023

Support Vector Machines: Introduction and the Dual Formulation

Chapter © 2020

Support Vector Machine

Chapter © 2022

Notes

1.
A hyperplane is a generic term used to represent a linear plane in n dimensions. In one-dimensional space, it is a dot; in two-dimensional space, it is a line; and in three-dimensional space, it is a regular plane. The same concept can be extended to higher dimensions where the geometric manifold cannot be directly visualized, but can be mathematically modelled.
2.
Sometimes this is also called as kernel trick, although this is far more than a simple trick. A function needs to satisfy certain properties in order to be able called as kernel function. For more details on kernel functions, refer to [55].

References

Real World Datasets from Sklearn https://scikit-learn.org/stable/datasets/realworld.html
Vladimir N. Vapnik, The Nature of Statistical Learning Theory, 2nd edn. (Springer, New York, 1995).
Google Scholar
V. N. Vapnik and A. Y. Lerner Pattern Recognition using Generazlied Portraits Automation and Remote Control, 24, 1963.
Google Scholar
Olivier Chapelle, Jason Weston, Leon Bottou and Vladimir Vapnik, Vicinal Risk Minization, NIPS, 2000.
Google Scholar
Vladimir Vapnik, Principles of Risk Minimization for Learning Theory, NIPS 1991.
Google Scholar

Download references

Author information

Authors and Affiliations

Microsoft (United States), Redmond, WA, USA
Ameet V. Joshi

Authors

Ameet V. Joshi
View author publications
You can also search for this author in PubMed Google Scholar

Appendix

New Lagrangian to be minimized is given as

$$\displaystyle \begin{aligned} \min_{\mathbf{w}} \left\{ \left(\frac{1}{n}\sum_{i=1}^{n}\max(0,1-y_i[(\mathbf{w}.\mathbf{x}) - w_0])\right) + \lambda \Phi(\mathbf{w}) \right\} \end{aligned} $$

(8.20)

Here, with the \(\max \) function, we are essentially ignoring the cases when there is error in the classification.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Cite this chapter

Joshi, A.V. (2023). Support Vector Machines. In: Machine Learning and Artificial Intelligence. Springer, Cham. https://doi.org/10.1007/978-3-031-12282-8_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-12282-8_8
Published: 02 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-12281-1
Online ISBN: 978-3-031-12282-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Hardcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions