204.6.3 SVM : The Algorithm

Link to the previous post : https://statinfer.com/204-6-2-practice-simple-classifier/

SVM- The large margin classifier

SVM is all about finding the maximum-margin Classifier.
Classifier is a generic name, its actually called the hyper plane.
- Hyper plane: In 3-dimensional system hyperplanes are the 2-dimensional planes, in 2-dimensional space its hyperplanes are the 1-dimensional lines.
SVM algorithm makes use of the nearest training examples to derive the classifier with maximum margin.
Each data point is considered as a p-dimensional vector (a list of p numbers).
SVM uses vector algebra and mathematical optimization to find the optimal hyperplane that has maximum margin.

The SVM Algorithm

If a dataset is linearly separable then we can always find a hyperplane f(x) such that
- For all negative labeled records f(x)<0
- For all positive labeled records f(x)>0
- This hyper plane f(x) is nothing but the linear classifier
- $f (x) = w_{1} x_{1} + w_{2} x_{2} + b$
- $f (x) = w^{T} x + b$

Math behind SVM Algorithm

SVM Algorithm – The Math

If you already understood the SVM technique and If you find this slide is too technical, you may want to skip it. The tool will take care of this optimization.

$f (x) = w^{T} x + b$
$w^{T} x^{+} + b = 1$ and $w^{T} x^{-} + b = - 1$
$x^{+} = x^{-} + λ w$
$w^{T} x^{+} + b = 1$
- $w^{T} (x^{-} + λ w) + b = 1$
- $w^{T} x^{-} + λ w . w + b = 1$
- $- 1 + λ w . w = 1$
- $λ = 2 / w . w$
$m = | x^{+} - x^{-} |$
- $m = | λ w |$
- $m = (2 / w . w) * | w |$
- $m = 2 / | | w | |$
Objective is to maximize $2 / | | w | |$
- i.e minimize $| | w | |$
A good decision boundary should be
- $w^{T} x^{+} + b >= 1$ for all y=1
- $w^{T} x^{-} + b <= - 1$ for all y=-1
- i.e $y * (w^{T} x + b) >= 1$ for all points
Now we have the optimization problem with objective and constraints
- minimize $| | w | |$ or $(½) * | | w | |^{2}$
- With constant $y (w^{T} x + b) >= 1$
We can solve the above optimization problem to obtain w & b

SVM Result

SVM doesn’t output probability. It directly gives which class the new data point belongs to
For a new point $x_{k}$

Calculate $w^T x_k +b. If this value is positive then the prediction is +1 else -1.

The next post is about building SVM model in python.

Link to the next post : https://statinfer.com/204-6-4-building-svm-model-in-python/

21st June 2017

204.6.3 SVM : The Algorithm

Getting into SVM

SVM- The large margin classifier

The SVM Algorithm

Math behind SVM Algorithm

SVM Algorithm – The Math

SVM Result

Statinfer

Statinfer

Statinfer

204.6.3 SVM : The Algorithm

Getting into SVM

SVM- The large margin classifier

The SVM Algorithm

Math behind SVM Algorithm

SVM Algorithm – The Math

SVM Result

Related Courses

Python(Batch6)

Statinfer

Tableau (Batch6)

Statinfer

PowerBI (Batch6)

Statinfer