<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Logistic regresion (proc logistics) vars preparation questions in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289741#M4338</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;I am using logistic regresion to predict a target var type (1,0).&lt;BR /&gt;One of the vars of my model is a classificarion var.&lt;/P&gt;
&lt;P&gt;a_type = ("high", "medium" , "low"), is a prediction var, not the target&lt;/P&gt;
&lt;P&gt;I use proc logistics.&lt;/P&gt;
&lt;P&gt;I don't know if it is recommended to transform this var in dummy vars like that:&lt;/P&gt;
&lt;P&gt;a_type_high = (1,0)&lt;BR /&gt;a_type_medium = (1,0)&lt;BR /&gt;a_type_low = (1,0)&lt;/P&gt;
&lt;P&gt;I suppose that kind of vars are better for logistic regression, isn't it?&lt;BR /&gt;If I don't transform the vars, does the proc do the transformation automatically?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Another question I also have several continuos/quantitative vars like sales (0-50), mkt_exp (0-1000)&lt;BR /&gt;do I have to no a normalization to transform in a var with avg=0 and std = 1?, is that needed?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 05 Aug 2016 07:35:58 GMT</pubDate>
    <dc:creator>juanvg1972</dc:creator>
    <dc:date>2016-08-05T07:35:58Z</dc:date>
    <item>
      <title>Logistic regresion (proc logistics) vars preparation questions</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289741#M4338</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;I am using logistic regresion to predict a target var type (1,0).&lt;BR /&gt;One of the vars of my model is a classificarion var.&lt;/P&gt;
&lt;P&gt;a_type = ("high", "medium" , "low"), is a prediction var, not the target&lt;/P&gt;
&lt;P&gt;I use proc logistics.&lt;/P&gt;
&lt;P&gt;I don't know if it is recommended to transform this var in dummy vars like that:&lt;/P&gt;
&lt;P&gt;a_type_high = (1,0)&lt;BR /&gt;a_type_medium = (1,0)&lt;BR /&gt;a_type_low = (1,0)&lt;/P&gt;
&lt;P&gt;I suppose that kind of vars are better for logistic regression, isn't it?&lt;BR /&gt;If I don't transform the vars, does the proc do the transformation automatically?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Another question I also have several continuos/quantitative vars like sales (0-50), mkt_exp (0-1000)&lt;BR /&gt;do I have to no a normalization to transform in a var with avg=0 and std = 1?, is that needed?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Aug 2016 07:35:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289741#M4338</guid>
      <dc:creator>juanvg1972</dc:creator>
      <dc:date>2016-08-05T07:35:58Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regresion (proc logistics) vars preparation questions</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289746#M4339</link>
      <description>&lt;P&gt;Categorical variables should be placed in the CLASS statement.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If it's your first time doing an analysis I like to find a worked example, work through that, then proceed to my data.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The&amp;nbsp;documentation has a good example of analysis with categorical predictors.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Another resource:&lt;/P&gt;
&lt;P&gt;&lt;A href="http://www.ats.ucla.edu/stat/sas/dae/logit.htm" target="_blank"&gt;http://www.ats.ucla.edu/stat/sas/dae/logit.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Normalization is up to you. If you choose to do so, look at proc stdize.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Are res you using SAS Enterprise Miner?&lt;/P&gt;</description>
      <pubDate>Fri, 05 Aug 2016 07:48:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289746#M4339</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2016-08-05T07:48:54Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regresion (proc logistics) vars preparation questions</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289805#M4341</link>
      <description>Hi, 

When you list the variable at Class statement under proc logistic, the Class statement option Param=EFFECT should do the dummy variable for you. The other commonly used option is Param=GLM. There is no fast rule as to which one is better. In using Proc logistic for predictive modeling,  these two options are most popular. There are another ~8 options you may explore. That is if your work is more design matrix sensitive. 

One issue more important for you actually is missing value status on the categorical variable. There is a Missing option at the Class statement you can read. Generally Proc logistic has been optimized continuously so the user does not have to spend time coding stuff manually. 

As for normalization, the question really relates to if the model is sensitive to distribution of  input variables, the interval variables. If the variable is too NOT normal, you should not normalize. Other factors  include 1. your link function. Many link functions are distribution tolerant, but not all. 2. sample size. Many modelers tend to ignore normality of input variables when the model universe is big. 3. Really normality matters if univariate study of the input variable is critical for fitting the model: in fitting models  like logistic regression, interactions among inputs are often more influential. 4. If one should normalize an interval input, the marginal improvement on its contribution towards the model's overall predictive accuracy tends to be: first, hard to measure. second, if measurable, tends to be insignificant. Hope this helps? Thanks for using SAS. Jason Xin</description>
      <pubDate>Fri, 05 Aug 2016 14:34:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289805#M4341</guid>
      <dc:creator>JasonXin</dc:creator>
      <dc:date>2016-08-05T14:34:00Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regresion (proc logistics) vars preparation questions</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289812#M4343</link>
      <description>&lt;P&gt;In the CLASS statement, look at the parameterization options. AFAIK param = Ref is the most common, and most easily interpretable way of specifying your variables. Make sure you review the design matrix and understand your output.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Aug 2016 14:48:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289812#M4343</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2016-08-05T14:48:33Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regresion (proc logistics) vars preparation questions</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289815#M4344</link>
      <description>&lt;P&gt;Thanks, I am using Enterprise Guide, no Miner.&lt;/P&gt;
&lt;P&gt;I am not sure, when to standarize or not.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for your help&lt;/P&gt;</description>
      <pubDate>Fri, 05 Aug 2016 15:04:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Logistic-regresion-proc-logistics-vars-preparation-questions/m-p/289815#M4344</guid>
      <dc:creator>juanvg1972</dc:creator>
      <dc:date>2016-08-05T15:04:06Z</dc:date>
    </item>
  </channel>
</rss>

