<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Logistic regression Procedure in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446904#M6817</link>
    <description>&lt;P&gt;It ignores the multi-variate nature of your predictor variables ... it ignores the correlations between the predictors, it ignores interactions between predictors.&amp;nbsp;If you are going to fit a model with multiple predictors, I am skeptical of the approach of looking at predictor variables one at a time.&lt;/P&gt;</description>
    <pubDate>Mon, 19 Mar 2018 18:25:23 GMT</pubDate>
    <dc:creator>PaigeMiller</dc:creator>
    <dc:date>2018-03-19T18:25:23Z</dc:date>
    <item>
      <title>Logistic regression Procedure</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446876#M6814</link>
      <description>&lt;P&gt;I am developing a Logistic regression model for predicting complaints in a customer service set-up.&lt;/P&gt;&lt;P&gt;Consider a credit card company scenario, the objective here is to identify the customer who is most likely to complain n the next week or so, based on&amp;nbsp;events that happened&amp;nbsp;in the past couple of months. the attributes can be change in payment behavior, filed for bankruptcy y/n etc.&lt;/P&gt;&lt;P&gt;I had almost 200&amp;nbsp;independent variables&amp;nbsp;in my data. the target variable is binary.&lt;/P&gt;&lt;P&gt;I ran&amp;nbsp; proc logistic for every individual variable against the target variable and calculated the c-scores.&lt;/P&gt;&lt;P&gt;I then chose the variables with c-scores over 0.525. I ran proc logistic again with just these variables (all together now) and chose my final predicting variables. I used forward stepwise selection this time.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My question is what is wrong in&amp;nbsp;following&amp;nbsp;this approach. I am worried about losing information by&amp;nbsp;short listing&amp;nbsp;the variables based on c-scores. I am following this methodology as that is what I have been told. Any&amp;nbsp;word of advice&amp;nbsp;would be greatly appreciated.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Mar 2018 17:17:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446876#M6814</guid>
      <dc:creator>SC_1991</dc:creator>
      <dc:date>2018-03-19T17:17:43Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression Procedure</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446886#M6815</link>
      <description>&lt;P&gt;What is wrong with this approach?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Go to your favorite search engine and type in "problems with stepwise regression".&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A better approach, in my opinion, is to use a logistic version of Partial Least Squares Regression. Again, your favorite search engine will find examples. PROC PLS in SAS does the calculations.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Mar 2018 17:37:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446886#M6815</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2018-03-19T17:37:04Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression Procedure</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446903#M6816</link>
      <description>What about the way I shortlisted the variables based on single variate c-scores and used only the shortlisted variables in running my logistic regression?</description>
      <pubDate>Mon, 19 Mar 2018 18:08:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446903#M6816</guid>
      <dc:creator>SC_1991</dc:creator>
      <dc:date>2018-03-19T18:08:50Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression Procedure</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446904#M6817</link>
      <description>&lt;P&gt;It ignores the multi-variate nature of your predictor variables ... it ignores the correlations between the predictors, it ignores interactions between predictors.&amp;nbsp;If you are going to fit a model with multiple predictors, I am skeptical of the approach of looking at predictor variables one at a time.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Mar 2018 18:25:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446904#M6817</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2018-03-19T18:25:23Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression Procedure</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446996#M6819</link>
      <description>&lt;P&gt;You might want to look at decision tree models from &lt;STRONG&gt;proc HpSplit&amp;nbsp;&lt;/STRONG&gt;for this kind of&amp;nbsp; problem. It will give you a better view of your data and might even provide you with a&amp;nbsp;predictive model.&lt;/P&gt;</description>
      <pubDate>Tue, 20 Mar 2018 03:38:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regression-Procedure/m-p/446996#M6819</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2018-03-20T03:38:50Z</dc:date>
    </item>
  </channel>
</rss>

