<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Correlated variables in classifiers in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Correlated-variables-in-classifiers/m-p/386095#M5701</link>
    <description>&lt;P&gt;Correlation among input variables could be a very important issue in classical regression where the structure of the model was critical to generating useful results and interpretation. &amp;nbsp;In most data mining scenarios, you have far more data than was available to historical approaches as well as powerful methods (linear &amp;amp; nonlinear) that allow you to model relationships using flexible models which adapt to your data. &amp;nbsp;You can use holdout data to empirically validate the relationships with data rather than relying on assumptions. &amp;nbsp;The amount of interpretation available differs from model to model. &amp;nbsp; Trees provide simple interpretability while neural network and SVM models do not lend themselves to interpretation. &amp;nbsp;Correlation is a concern for interpretation of simple regression models but interpretation is not meaningful if the model is inadequate which they often are. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you would benefit from broader training in using these methods, check out the training available at&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &lt;A href="http://support.sas.com/training/us/paths/dm.html" target="_self"&gt;http://support.sas.com/training/us/paths/dm.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;where you can get a better understand about how these different models can be used.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope this helps!&lt;/P&gt;
&lt;P&gt;Doug&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 07 Aug 2017 17:44:01 GMT</pubDate>
    <dc:creator>DougWielenga</dc:creator>
    <dc:date>2017-08-07T17:44:01Z</dc:date>
    <item>
      <title>Correlated variables in classifiers</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlated-variables-in-classifiers/m-p/363949#M5432</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I was wondering if someone can help me? I am using some classifiers and my dataset contains some variables with no correlation and some with high correlation.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am looking to know that how important classifiers like SVM, Neural Network, Decision Tree, Random Forest and Logistic Regression deal with highly correlated variables?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Do they automatically reject them for classification or we have to manage ourself?&lt;/P&gt;&lt;P&gt;If we have to manage ourself then how can I manage that ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In all of these classifiers how the selection and rejection of features takes place?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 02 Jun 2017 22:09:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlated-variables-in-classifiers/m-p/363949#M5432</guid>
      <dc:creator>geniusgenie</dc:creator>
      <dc:date>2017-06-02T22:09:12Z</dc:date>
    </item>
    <item>
      <title>Re: Correlated variables in classifiers</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Correlated-variables-in-classifiers/m-p/386095#M5701</link>
      <description>&lt;P&gt;Correlation among input variables could be a very important issue in classical regression where the structure of the model was critical to generating useful results and interpretation. &amp;nbsp;In most data mining scenarios, you have far more data than was available to historical approaches as well as powerful methods (linear &amp;amp; nonlinear) that allow you to model relationships using flexible models which adapt to your data. &amp;nbsp;You can use holdout data to empirically validate the relationships with data rather than relying on assumptions. &amp;nbsp;The amount of interpretation available differs from model to model. &amp;nbsp; Trees provide simple interpretability while neural network and SVM models do not lend themselves to interpretation. &amp;nbsp;Correlation is a concern for interpretation of simple regression models but interpretation is not meaningful if the model is inadequate which they often are. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you would benefit from broader training in using these methods, check out the training available at&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &lt;A href="http://support.sas.com/training/us/paths/dm.html" target="_self"&gt;http://support.sas.com/training/us/paths/dm.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;where you can get a better understand about how these different models can be used.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope this helps!&lt;/P&gt;
&lt;P&gt;Doug&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Aug 2017 17:44:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Correlated-variables-in-classifiers/m-p/386095#M5701</guid>
      <dc:creator>DougWielenga</dc:creator>
      <dc:date>2017-08-07T17:44:01Z</dc:date>
    </item>
  </channel>
</rss>

