Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Support Vector Machines

  • Chapter
  • First Online:
Machine Learning and Artificial Intelligence

Abstract

In this chapter we are going to study the concept of support vector machines as developed by Vapnik and others. The concept was first proposed as an alternative to neural networks, when neural networks were not performing up to the grand expectations that they came with. SVM proposed a very targeted mathematical approach towards finding the optimal solution in case of classification or regression. We will first study the original SVM theory that tries to solve the problem of linear classification. Then, we will see how it can be further generalized for nonlinear problems with use of kernels and also how it is extended for solving the problems of regression. The theory of SVM proposed an elegant solution towards optimization and generalization and more importantly was extremely successful in getting results that neural network-based methods only hoped for at the time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 79.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    A hyperplane is a generic term used to represent a linear plane in n dimensions. In one-dimensional space, it is a dot; in two-dimensional space, it is a line; and in three-dimensional space, it is a regular plane. The same concept can be extended to higher dimensions where the geometric manifold cannot be directly visualized, but can be mathematically modelled.

  2. 2.

    Sometimes this is also called as kernel trick, although this is far more than a simple trick. A function needs to satisfy certain properties in order to be able called as kernel function. For more details on kernel functions, refer to [55].

References

  1. Real World Datasets from Sklearn https://scikit-learn.org/stable/datasets/realworld.html

  2. Vladimir N. Vapnik, The Nature of Statistical Learning Theory, 2nd edn. (Springer, New York, 1995).

    Google Scholar 

  3. V. N. Vapnik and A. Y. Lerner Pattern Recognition using Generazlied Portraits Automation and Remote Control, 24, 1963.

    Google Scholar 

  4. Olivier Chapelle, Jason Weston, Leon Bottou and Vladimir Vapnik, Vicinal Risk Minization, NIPS, 2000.

    Google Scholar 

  5. Vladimir Vapnik, Principles of Risk Minimization for Learning Theory, NIPS 1991.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Appendix

Appendix

New Lagrangian to be minimized is given as

$$\displaystyle \begin{aligned} \min_{\mathbf{w}} \left\{ \left(\frac{1}{n}\sum_{i=1}^{n}\max(0,1-y_i[(\mathbf{w}.\mathbf{x}) - w_0])\right) + \lambda \Phi(\mathbf{w}) \right\} \end{aligned} $$
(8.20)

Here, with the \(\max \) function, we are essentially ignoring the cases when there is error in the classification.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Joshi, A.V. (2023). Support Vector Machines. In: Machine Learning and Artificial Intelligence. Springer, Cham. https://doi.org/10.1007/978-3-031-12282-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-12282-8_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-12281-1

  • Online ISBN: 978-3-031-12282-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics