<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: question about variable selesction in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/289890#M15370</link>
    <description>Hi Rick,&lt;BR /&gt;Thank you for your solution!&lt;BR /&gt;Yes, imputation is a way to address missing values. But I'm not looking for imputation. I just want to use the information I collected, for this is patient information I'm dealing with.&lt;BR /&gt;&lt;BR /&gt;I want SAS to add more observations, ( i.e. increase the dim of X) when less covariates are considered. Do you have a suggestion for this?&lt;BR /&gt;&lt;BR /&gt;Thanks again!!!&lt;BR /&gt;&lt;BR /&gt;Best wishes.</description>
    <pubDate>Fri, 05 Aug 2016 21:37:31 GMT</pubDate>
    <dc:creator>Xiaoningdemao</dc:creator>
    <dc:date>2016-08-05T21:37:31Z</dc:date>
    <item>
      <title>question about variable selesction</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/287961#M15363</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here&amp;nbsp;I have a question about variable selesction&amp;nbsp;in SAS. Say I have the following data set:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;age gender &amp;nbsp;height&amp;nbsp;&amp;nbsp; &amp;nbsp;weight&amp;nbsp;&amp;nbsp; &amp;nbsp;death&lt;/P&gt;&lt;P&gt;89&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;110&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&lt;/P&gt;&lt;P&gt;70&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;58&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; .&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;0&lt;/P&gt;&lt;P&gt;50&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;173&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp; 130&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;And&amp;nbsp;I want to fit a&amp;nbsp;logistic regression with backwards variable selection, so&amp;nbsp;I coded like this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="2"&gt;&lt;STRONG&gt;proc&lt;/STRONG&gt;&lt;/FONT&gt; &lt;STRONG&gt;&lt;FONT color="#000080" face="Courier New" size="2"&gt;logistic&lt;/FONT&gt;&lt;/STRONG&gt; &lt;FONT color="#0000ff" face="Courier New" size="2"&gt;data&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;=have &lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;descending&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;; &lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;class&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt; gender&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;model&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt; death =&amp;nbsp;age gender height weight&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;&amp;nbsp;/&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;selection&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;=backward &lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;&lt;FONT color="#0000ff" face="Courier New" size="2"&gt;fast&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt; ;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="2"&gt;&lt;FONT color="#000080" face="Courier New" size="2"&gt;&lt;FONT color="#000080" face="Courier New" size="2"&gt;&lt;STRONG&gt;run&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;But since the data has missing value, only the last obs will be used to do this model selection (i.e program will delete entries that has missing values based on full model). &lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;But, I want the program to include more obs when it evaluate model with: age gender&amp;nbsp;height, i.e. use obs 2 and 3. Is there a&amp;nbsp;command to make&amp;nbsp;this&amp;nbsp;haapened in SAS ??&amp;nbsp;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;Thank you very much!&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="Courier New" size="2"&gt;&lt;FONT face="Courier New" size="2"&gt;Best,&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 28 Jul 2016 21:04:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/287961#M15363</guid>
      <dc:creator>Xiaoningdemao</dc:creator>
      <dc:date>2016-07-28T21:04:43Z</dc:date>
    </item>
    <item>
      <title>Re: question about variable selesction</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/289787#M15364</link>
      <description>&lt;P&gt;In classical linear models, the regression needs to form the so-called SSCP matrix X`*X. &amp;nbsp;To form this matrix product require removing observations that have missing values.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;See &lt;A href="https://communities.sas.com/t5/General-SAS-Programming/Missing-values-in-logistic-regression/td-p/85056" target="_self"&gt;a&amp;nbsp;previous discussion about this topic for other options,&lt;/A&gt;&amp;nbsp;including multiple imputation with PROC MI.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you are committed to PROC LOGISTIC, multiple imputation is a good solution. For survey data, SAS provides PROC SURVEYLOGISIC. If you can express your model as a mixed model, PROC GLIMMIX handles missing data differently.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There is extensive literature in this area. I particulalry like the research and suggestions by Paul Allison, and recommend that you do an internet search for&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;logistic regression "missing data" site:statisticalhorizons.com&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Aug 2016 13:11:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/289787#M15364</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2016-08-05T13:11:26Z</dc:date>
    </item>
    <item>
      <title>Re: question about variable selesction</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/289890#M15370</link>
      <description>Hi Rick,&lt;BR /&gt;Thank you for your solution!&lt;BR /&gt;Yes, imputation is a way to address missing values. But I'm not looking for imputation. I just want to use the information I collected, for this is patient information I'm dealing with.&lt;BR /&gt;&lt;BR /&gt;I want SAS to add more observations, ( i.e. increase the dim of X) when less covariates are considered. Do you have a suggestion for this?&lt;BR /&gt;&lt;BR /&gt;Thanks again!!!&lt;BR /&gt;&lt;BR /&gt;Best wishes.</description>
      <pubDate>Fri, 05 Aug 2016 21:37:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/289890#M15370</guid>
      <dc:creator>Xiaoningdemao</dc:creator>
      <dc:date>2016-08-05T21:37:31Z</dc:date>
    </item>
    <item>
      <title>Re: question about variable selesction</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/289967#M15375</link>
      <description>&lt;P&gt;I don't understand what you mean by "add more observations." SAS (and all statistical software) analyzes the data you have. Can you provide an example? For the data you've presented, what would you like to happen if only two or three covariates are being analyzed?&lt;/P&gt;</description>
      <pubDate>Sat, 06 Aug 2016 10:38:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/289967#M15375</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2016-08-06T10:38:36Z</dc:date>
    </item>
    <item>
      <title>Re: question about variable selesction</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/290317#M15421</link>
      <description>Dear Rick,&lt;BR /&gt;I have around 1,000 obs of 26 variables and one outcome. but only 300 obs are complete. So I will lose a lot information, if I only use the complete observations.&lt;BR /&gt;Say for first 100 obs, only variable1 is missing, when I run the model outcome=variable2-26, I want the first 100 obs to be added to the matrix. In this way I feel like I can make more use of the information collected.&lt;BR /&gt;Is this make sense? If it does, can I use SAS to realize this?&lt;BR /&gt;Thank you!!&lt;BR /&gt;Best wishes.</description>
      <pubDate>Mon, 08 Aug 2016 20:52:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/290317#M15421</guid>
      <dc:creator>Xiaoningdemao</dc:creator>
      <dc:date>2016-08-08T20:52:27Z</dc:date>
    </item>
    <item>
      <title>Re: question about variable selesction</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/290431#M15433</link>
      <description>&lt;P&gt;I am not aware of any variable selection technique for logistic regression in which different observations are used for different sets of candidate variables.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There are other predictive models that are more tolerant of missing values. &amp;nbsp;You might &lt;A href="http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_hpsplit_overview.htm" target="_self"&gt;look at PROC HPSPLIT&lt;/A&gt;, which uses tree-based models for building regression models.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Aug 2016 12:01:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/290431#M15433</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2016-08-09T12:01:17Z</dc:date>
    </item>
    <item>
      <title>Re: question about variable selesction</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/290772#M15458</link>
      <description>Dear Rick,&lt;BR /&gt;&lt;BR /&gt;Thank you very much! I will look into it.&lt;BR /&gt;&lt;BR /&gt;Have a nice day~&lt;BR /&gt;&lt;BR /&gt;Best wishes.</description>
      <pubDate>Wed, 10 Aug 2016 17:20:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/question-about-variable-selesction/m-p/290772#M15458</guid>
      <dc:creator>Xiaoningdemao</dc:creator>
      <dc:date>2016-08-10T17:20:43Z</dc:date>
    </item>
  </channel>
</rss>

