<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Kernals in Support Vector Machines in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31963#M170</link>
    <description>An experimenta linear and nonlinear kernel SVM node is planned for EM 7.1 SAS9.3 mid next year. Thanks for you use of the software. There are quite a few other classification and prediction tools you will want to try.  Some users have reported good Gradient Boosting results. Will keep you and the forum up to date.</description>
    <pubDate>Wed, 10 Nov 2010 22:31:04 GMT</pubDate>
    <dc:creator>WayneThompson</dc:creator>
    <dc:date>2010-11-10T22:31:04Z</dc:date>
    <item>
      <title>Kernals in Support Vector Machines</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31962#M169</link>
      <description>This is more theoretical than SAS related:&lt;BR /&gt;
&lt;BR /&gt;
I continuously see kernel functions in SVMs as k(xi,xj), or k(x,y) or&lt;BR /&gt;
k(x,z)&lt;BR /&gt;
&lt;BR /&gt;
Are kernel functions in SVMs functions (dot products) of independent&lt;BR /&gt;
variables (x's) or a function of independent and dependent&lt;BR /&gt;
variables(x,y)?&lt;BR /&gt;
&lt;BR /&gt;
This also confuses my understanding of the relationship between kernel&lt;BR /&gt;
functions and kernel matrices.&lt;BR /&gt;
&lt;BR /&gt;
Any help?&lt;BR /&gt;
&lt;BR /&gt;
I currently have SAS EM 6.2. As far as I know SVMs aren't possible with this version. Any news on their implementation in future releases?&lt;BR /&gt;
&lt;BR /&gt;
-thanks</description>
      <pubDate>Wed, 10 Nov 2010 19:23:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31962#M169</guid>
      <dc:creator>SlutskyFan</dc:creator>
      <dc:date>2010-11-10T19:23:52Z</dc:date>
    </item>
    <item>
      <title>Re: Kernals in Support Vector Machines</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31963#M170</link>
      <description>An experimenta linear and nonlinear kernel SVM node is planned for EM 7.1 SAS9.3 mid next year. Thanks for you use of the software. There are quite a few other classification and prediction tools you will want to try.  Some users have reported good Gradient Boosting results. Will keep you and the forum up to date.</description>
      <pubDate>Wed, 10 Nov 2010 22:31:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31963#M170</guid>
      <dc:creator>WayneThompson</dc:creator>
      <dc:date>2010-11-10T22:31:04Z</dc:date>
    </item>
    <item>
      <title>Re: Kernals in Support Vector Machines</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31964#M171</link>
      <description>Hi Slutsky,&lt;BR /&gt;
&lt;BR /&gt;
To answer your query, the various notations you come across in the kernel literature is dependent on the user group. From a notation perspective x, y (or z) refer to vectors where as xi and xj refers to element i and j within vector x. &lt;BR /&gt;
&lt;BR /&gt;
A kernel matrix is usually donated with capital K whereas k is the kernel function (dot product). Hence, where as K is a matrix k(x,y) will be only an entry scalar in matrix K.&lt;BR /&gt;
&lt;BR /&gt;
To answer your second question regarding dependence and independence. The assumption is that the data is iid (identically and identically distributed). &lt;BR /&gt;
&lt;BR /&gt;
Perhaps this pseudocode will help understand the notation&lt;BR /&gt;
X - matrix of size nxm (n sample and m features)&lt;BR /&gt;
K - matrix of size nxn&lt;BR /&gt;
&lt;BR /&gt;
for i=1 to n&lt;BR /&gt;
  for j=1 to n&lt;BR /&gt;
     K[i,i] = X[i,:]*X[j,:]    % a linear dot product&lt;BR /&gt;
  endfor&lt;BR /&gt;
endfor&lt;BR /&gt;
&lt;BR /&gt;
a good book is 'introduction to support vector machine' by cristinanini and shawe-taylor</description>
      <pubDate>Tue, 16 Nov 2010 03:53:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31964#M171</guid>
      <dc:creator>DavidR_Hardoon</dc:creator>
      <dc:date>2010-11-16T03:53:36Z</dc:date>
    </item>
    <item>
      <title>Re: Kernals in Support Vector Machines</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31965#M172</link>
      <description>Thanks, this really helps. I think I'm getting a better picture. You said:&lt;BR /&gt;
&lt;BR /&gt;
"where as K is a matrix k(x,y) will be only an entry scalar in matrix K."&lt;BR /&gt;
&lt;BR /&gt;
I think I understand, but based on what you said is my following interpretation correct?&lt;BR /&gt;
&lt;BR /&gt;
#1  k(x,y) is a kernal function (which produces a scaler that becomes and entry in matrix K) &lt;BR /&gt;
&lt;BR /&gt;
If that is true, then does k(x,y) produce a dot product only between xi and xj (elements of the matrix X) or are they dot products also between x and y?&lt;BR /&gt;
&lt;BR /&gt;
I'm thinking the entries in the kernal matrix are only dot products of xi and xj given your pseudocode , and y is just a 'label'. &lt;BR /&gt;
&lt;BR /&gt;
But, I've also seen 'kernal functions' depicted in 2 different ways:&lt;BR /&gt;
&lt;BR /&gt;
gaussian kernal: k(x,y) = exp(-||x-y||^2 / sigma^2)&lt;BR /&gt;
&lt;BR /&gt;
gaussian kernal: k(xi,xj) = exp(-||xi-xj||^2 / sigma^2)&lt;BR /&gt;
&lt;BR /&gt;
So I'm still confused on the notation about what are the 'inputs' into the kernel function, are the elements only of some matrix X, or can they also contain elements of Y?&lt;BR /&gt;
&lt;BR /&gt;
Thanks.</description>
      <pubDate>Wed, 17 Nov 2010 16:07:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31965#M172</guid>
      <dc:creator>SlutskyFan</dc:creator>
      <dc:date>2010-11-17T16:07:10Z</dc:date>
    </item>
    <item>
      <title>Re: Kernals in Support Vector Machines</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31966#M173</link>
      <description>Thanks so much. Looking forward to it. I've actually got my hands full dealing with all that is available in EM anyway- but I like to stay ahead of the curve.</description>
      <pubDate>Wed, 17 Nov 2010 16:14:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31966#M173</guid>
      <dc:creator>SlutskyFan</dc:creator>
      <dc:date>2010-11-17T16:14:55Z</dc:date>
    </item>
    <item>
      <title>Re: Kernals in Support Vector Machines</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31967#M174</link>
      <description>SlutskyFan:&lt;BR /&gt;
The Gaussian kernel that you mentioned fits in the scheme set by David thus:&lt;BR /&gt;
&lt;BR /&gt;
First, rewrite David's pseudo-code as follows:&lt;BR /&gt;
X[1,.],..., X[n,.] - n elements of a (Hilbert) space with inner product &amp;lt;,&amp;gt;&lt;BR /&gt;
K - matrix of size nxn&lt;BR /&gt;
&lt;BR /&gt;
for i=1 to n&lt;BR /&gt;
for j=1 to n&lt;BR /&gt;
K[i,i] = &lt;X&gt;  % a linear dot product&lt;BR /&gt;
endfor&lt;BR /&gt;
endfor &lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
To make this work for a sample of size n of m features: c_1, c_2, ..., c_n, apply the pseudo code to the result of transforming the feature vectors according to a function f, i.e. apply the pseudo-code to X[1,] = f(c_1) , X[2,] = f(c_2), ... , X[n,]=f(c_n).&lt;BR /&gt;
&lt;BR /&gt;
Here is the function: let c be an m dimensional vector. To c we will assign an element in an infinite dimensional space, a space of functions defined in m dimensional vectors. f(c) is a function of another variable h defined as:&lt;BR /&gt;
f(c)(h) = exp( -  (||c - h||^2)/(2*sigma^2)  )&lt;BR /&gt;
&lt;BR /&gt;
Here is the definition of the inner product (all technicalities aside):&lt;BR /&gt;
if A and B are (sufficiently nice) functions of h in R^m:&lt;BR /&gt;
&lt;A&gt; = integral over R^m of A(h)B(h) dh&lt;BR /&gt;
&lt;BR /&gt;
Now, it is a long exercise to verify that:&lt;BR /&gt;
&lt;F&gt; = exp( -  (||x - c||^2)/(sigma^2)  )&lt;BR /&gt;
- it is the same computation that verifies that the sum of two independent normally distributed random variables is also normal - &lt;BR /&gt;
&lt;BR /&gt;
In other words:&lt;BR /&gt;
&lt;X&gt; = &lt;F&gt; = exp( -  (||c_1 - c_2||^2)/(sigma^2)  )&lt;BR /&gt;
&lt;BR /&gt;
Different functions f give rise to different kernels. Different mappings of the feature vectors into larger spaces give rise to different point configurations.&lt;/F&gt;&lt;/X&gt;&lt;/F&gt;&lt;/A&gt;&lt;/X&gt;</description>
      <pubDate>Wed, 16 Feb 2011 20:34:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Kernals-in-Support-Vector-Machines/m-p/31967#M174</guid>
      <dc:creator>VictorZurkowski</dc:creator>
      <dc:date>2011-02-16T20:34:30Z</dc:date>
    </item>
  </channel>
</rss>

