<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Variable selection in Proc Logistic - dealing with missing values in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Variable-selection-in-Proc-Logistic-dealing-with-missing-values/m-p/287996#M15276</link>
    <description>&lt;P&gt;The procedure will only use records that have values for all the model variables.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You might try imputation to replace missing values. There are a number of ways but if you have significant percentage of missing values for a given variable then your resulting model is going to be suspect.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What percentage of your records are missing one or more of the variables?&lt;/P&gt;</description>
    <pubDate>Thu, 28 Jul 2016 22:47:27 GMT</pubDate>
    <dc:creator>ballardw</dc:creator>
    <dc:date>2016-07-28T22:47:27Z</dc:date>
    <item>
      <title>Variable selection in Proc Logistic - dealing with missing values</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Variable-selection-in-Proc-Logistic-dealing-with-missing-values/m-p/287976#M15275</link>
      <description>&lt;P&gt;Dear All,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here I have a question about variale selection. Say I have the following data set:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Age&amp;nbsp;&amp;nbsp;&amp;nbsp; Sex&amp;nbsp;&amp;nbsp;&amp;nbsp; weight&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Pass&lt;/P&gt;
&lt;P&gt;15&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 70&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;1&lt;/P&gt;
&lt;P&gt;17 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;-&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;
&lt;P&gt;16&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;-&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;60&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp; 1&amp;nbsp;&amp;nbsp; ;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;And I want to use logistic regression to predice pass using age, sex and weight . also I want to&amp;nbsp;use backwards variable selection to selsect significant covariate. So I coded as following:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="2" color="#000080"&gt;&lt;STRONG&gt;proc&lt;/STRONG&gt;&lt;/FONT&gt; &lt;STRONG&gt;&lt;FONT face="Courier New" size="2" color="#000080"&gt;logistic&lt;/FONT&gt;&lt;/STRONG&gt; &lt;FONT face="Courier New" size="2" color="#0000ff"&gt;data&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;=have &lt;/FONT&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;descending&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;; &lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;model&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt; pass= age sex weight &lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;/&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;selection&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;=backward &lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;&lt;FONT face="Courier New" size="2" color="#0000ff"&gt;fast&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="2" color="#000080"&gt;&lt;FONT face="Courier New" size="2" color="#000080"&gt;&lt;FONT face="Courier New" size="2" color="#000080"&gt;&lt;STRONG&gt;run&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;; &lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;However, since I have missing data here. The program only use the entry that has complete observation&amp;nbsp;based on full model, therefore the first enrty.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But I want it to enclude more observation&amp;nbsp;as eleminate covariates, that is when model only with age and sex, i want it to use first two entries. Is there a way to realize this in SAS?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you very much!!!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Best,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;HereH&lt;/P&gt;</description>
      <pubDate>Thu, 28 Jul 2016 22:15:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Variable-selection-in-Proc-Logistic-dealing-with-missing-values/m-p/287976#M15275</guid>
      <dc:creator>Xiaoningdemao</dc:creator>
      <dc:date>2016-07-28T22:15:33Z</dc:date>
    </item>
    <item>
      <title>Re: Variable selection in Proc Logistic - dealing with missing values</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Variable-selection-in-Proc-Logistic-dealing-with-missing-values/m-p/287996#M15276</link>
      <description>&lt;P&gt;The procedure will only use records that have values for all the model variables.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You might try imputation to replace missing values. There are a number of ways but if you have significant percentage of missing values for a given variable then your resulting model is going to be suspect.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What percentage of your records are missing one or more of the variables?&lt;/P&gt;</description>
      <pubDate>Thu, 28 Jul 2016 22:47:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Variable-selection-in-Proc-Logistic-dealing-with-missing-values/m-p/287996#M15276</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2016-07-28T22:47:27Z</dc:date>
    </item>
    <item>
      <title>Re: Variable selection in Proc Logistic - dealing with missing values</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Variable-selection-in-Proc-Logistic-dealing-with-missing-values/m-p/288021#M15278</link>
      <description>&lt;P&gt;You could screen your predictors with PROC HPSPLIT. It offers three methods for dealing with missing predictors. The output decision tree is a crude model but will show you which variables are the most important predictors.&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jul 2016 02:25:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Variable-selection-in-Proc-Logistic-dealing-with-missing-values/m-p/288021#M15278</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2016-07-29T02:25:43Z</dc:date>
    </item>
  </channel>
</rss>

