<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: PROC MIXED Missing Data in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604699#M175335</link>
    <description>Thanks very much for the clarification, and makes sense.&lt;BR /&gt;&lt;BR /&gt;However, it looks to me as if it’s dropping every observation that is missing 1+ IV value - assuming that the 94 used observations are exclusively observations with NO missing values on any of the IVs.&lt;BR /&gt;&lt;BR /&gt;Is there any way to avoid this listwise deletion?</description>
    <pubDate>Sat, 16 Nov 2019 14:08:23 GMT</pubDate>
    <dc:creator>ROLuke91</dc:creator>
    <dc:date>2019-11-16T14:08:23Z</dc:date>
    <item>
      <title>PROC MIXED Missing Data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604576#M175296</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am confused about the way PROC MIXED handles missing data.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to use PROC MIXED with maximum-likelihood specification to run a multiple linear regression (neither repeated measures nor mixed effects; just a straightforward multiple linear regression). I have significant and varying amounts of missing data across my independent variables. I have a total N of 761, and the missing distribution is as follows:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE border="0" cellspacing="0" cellpadding="0"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Var&lt;/TD&gt;&lt;TD&gt;N&lt;/TD&gt;&lt;TD&gt;N Miss&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Y&lt;/TD&gt;&lt;TD&gt;323&lt;/TD&gt;&lt;TD&gt;438&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X1&lt;/TD&gt;&lt;TD&gt;757&lt;/TD&gt;&lt;TD&gt;4&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X2&lt;/TD&gt;&lt;TD&gt;321&lt;/TD&gt;&lt;TD&gt;440&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X3&lt;/TD&gt;&lt;TD&gt;321&lt;/TD&gt;&lt;TD&gt;440&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X4&lt;/TD&gt;&lt;TD&gt;761&lt;/TD&gt;&lt;TD&gt;0&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X5&lt;/TD&gt;&lt;TD&gt;384&lt;/TD&gt;&lt;TD&gt;377&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X6&lt;/TD&gt;&lt;TD&gt;547&lt;/TD&gt;&lt;TD&gt;214&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X7&lt;/TD&gt;&lt;TD&gt;752&lt;/TD&gt;&lt;TD&gt;9&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X8&lt;/TD&gt;&lt;TD&gt;761&lt;/TD&gt;&lt;TD&gt;0&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X9&lt;/TD&gt;&lt;TD&gt;319&lt;/TD&gt;&lt;TD&gt;442&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;X10&lt;/TD&gt;&lt;TD&gt;319&lt;/TD&gt;&lt;TD&gt;442&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My understanding, however, is that Proc Mixed is not supposed to be deleting listwise...however, when I run the analysis, it excludes 667 observations (only using 94). Is there any way I can specify PROC MIXED or use any other relevant procedure to run a multiple linear regression that will NOT listwise delete upon encountering a missing for that IV?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Luke&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2019 18:28:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604576#M175296</guid>
      <dc:creator>ROLuke91</dc:creator>
      <dc:date>2019-11-15T18:28:31Z</dc:date>
    </item>
    <item>
      <title>Re: PROC MIXED Missing Data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604596#M175301</link>
      <description>&lt;P&gt;I think what you are (mis)remembering is that for repeated measures ANOVA, the MIXED procedure does not perform listwise deletion, unlike the GLM formulation. That is because GLM uses the"wide" data format whereas MIXED uses a "long" data format. But when you have continuous covariates for&amp;nbsp; linear regression, even the MIXED model will delete observations for which a covariate has missing data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;No method will be able to use the 438 observations that has missing response values, so the best you can do is use 323 obs to fit the model.&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2019 19:19:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604596#M175301</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2019-11-15T19:19:47Z</dc:date>
    </item>
    <item>
      <title>Re: PROC MIXED Missing Data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604699#M175335</link>
      <description>Thanks very much for the clarification, and makes sense.&lt;BR /&gt;&lt;BR /&gt;However, it looks to me as if it’s dropping every observation that is missing 1+ IV value - assuming that the 94 used observations are exclusively observations with NO missing values on any of the IVs.&lt;BR /&gt;&lt;BR /&gt;Is there any way to avoid this listwise deletion?</description>
      <pubDate>Sat, 16 Nov 2019 14:08:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604699#M175335</guid>
      <dc:creator>ROLuke91</dc:creator>
      <dc:date>2019-11-16T14:08:23Z</dc:date>
    </item>
    <item>
      <title>Re: PROC MIXED Missing Data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604784#M175374</link>
      <description>&lt;P&gt;Correct. If an observation has a missing value in any IV, that observation cannot be used to fit the model. That is a mathematical fact. It has nothing to do with SAS or any other software.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Think about fitting a line to the following two-dimensional (X,Y) data:&lt;/P&gt;
&lt;P&gt;X Y&lt;/P&gt;
&lt;P&gt;0&amp;nbsp; 0&lt;/P&gt;
&lt;P&gt;.&amp;nbsp; 1&lt;/P&gt;
&lt;P&gt;.&amp;nbsp; 2&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Try to plot these points and then determine the line of best fit. It's impossible because only one observation has complete data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Similarly, the procedure cannot use the observations that are not complete. It's not that SAS doesn't like missing values, it is simply that those observations do not provide any useful information for the fit.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is why some practitioners perform imputation to reduce the impact of missing data. Imputation is replacing a missing value by a plausible nonmissing value. You can read the article &lt;A href="https://blogs.sas.com/content/iml/2017/12/04/mean-imputation-sas.html" target="_self"&gt;"Mean imputation in SAS"&lt;/A&gt;&amp;nbsp;to learn more about simple imputation methods.&lt;/P&gt;</description>
      <pubDate>Sun, 17 Nov 2019 11:24:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/604784#M175374</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2019-11-17T11:24:13Z</dc:date>
    </item>
    <item>
      <title>Re: PROC MIXED Missing Data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/605021#M175490</link>
      <description>&lt;P&gt;Thank you very much for all of your help and clarity. I moved on to work with PROC CALIS and FIML, but this was very helpful!&lt;/P&gt;</description>
      <pubDate>Mon, 18 Nov 2019 14:43:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PROC-MIXED-Missing-Data/m-p/605021#M175490</guid>
      <dc:creator>ROLuke91</dc:creator>
      <dc:date>2019-11-18T14:43:48Z</dc:date>
    </item>
  </channel>
</rss>

