• Nenhum resultado encontrado

We first find  (=1, …, 5) by We use the polynomial kernel of degree 2 Suppose we have 5 1D data points Example

N/A
N/A
Protected

Academic year: 2022

Share "We first find  (=1, …, 5) by We use the polynomial kernel of degree 2 Suppose we have 5 1D data points Example"

Copied!
8
0
0

Texto

(1)

Example

Suppose we have 5 1D data points

x1=1, x2=2, x3=4, x4=5, x5=6, with 1, 2, 6 as class 1 and 4, 5 as class 2  y1=1, y2=1, y3=-1, y4=-1, y5=1

We use the polynomial kernel of degree 2

K(x,y) = (xy+1)2

C is set to 100

We first find i (i=1, …, 5) by

(2)

Example

By using a QP solver, we get

1=0, 2=2.5, 3=0, 4=7.333, 5=4.833

Note that the constraints are indeed satisfied

The support vectors are {x2=2, x4=5, x5=6}

The discriminant function is

b is recovered by solving f(2)=1 or by f(5)=-1 or

(3)

Example

Value of discriminant function

1 2 4 5 6

class 2 class 1

class 1

(4)

Multi-class Classification

SVM is basically a two-class classifier

One can change the QP formulation to allow multi- class classification

More commonly, the data set is divided into two parts “intelligently” in different ways and a

separate SVM is trained for each way of division

Multi-class classification is done by combining the output of all the SVM classifiers

(5)

Software

A list of SVM implementation can be found at http://www.kernel-machines.org/software.html

Some implementation (such as LIBSVM) can handle multi-class classification

SVMLight is among one of the earliest implementation of SVM

Several Matlab toolboxes for SVM are also available

(6)

Summary: Steps for Classification

Prepare the pattern matrix

Select the kernel function to use

Select the parameter of the kernel function and the value of C

You can use the values suggested by the SVM software, or you can set apart a validation set to determine the values of the parameter

Execute the training algorithm and obtain the 

(7)

Strengths and Weaknesses of SVM

Strengths

Training is relatively easy

No local optimal, unlike in neural networks

It scales relatively well to high dimensional data

Tradeoff between classifier complexity and error can be controlled explicitly

Non-traditional data like strings and trees can be used as input to SVM, instead of feature vectors

Weaknesses

Need a “good” kernel function

(8)

Resources

http://www.kernel-machines.org/

http://www.support-vector.net/

http://www.support-vector.net/icml-tutorial.pdf

http://www.kernel-

machines.org/papers/tutorial-nips.ps.gz

http://www.clopinet.com/isabelle/Projects/SVM/

applist.html

Referências

Documentos relacionados

Secondly, we construct the balls using core sets from each pair of support vectors and we use this balls in order to catch all the data points (near of classification hyper

Using non-lethal methods for data collection of individuals and participatory monitoring by fishermen, we provide the first empirical evidence of the use of a nursery area by

Since the scattered electric field used as the input for the inverse procedure is also a complex vector, we have 6 × 257 = 1542 data parameters for the first example and 6 × 515 =

We didn’t find studies that used 6minWT concomitant to hypoxia, but despite the lack of comparison, we could suppose that the walked distance reduction observed may be related

[r]

We use instead USPTO patent data to measure the relevant dimensions of the technological regimes prevailing in each sector and, on this basis of this information, we

Contudo também será possível verificar que a introdução do valor desfasado do produto bancário referente ao 1º semestre de 2014 têm uma relevância diminuta na formação do

(2) This reality must be inquired into, if we are to find the best standard of truth (we have said that we ought to determine our thoughts by the given standard of a true idea, and