<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How differently do SAS and STATA deal with missing values when running regressions? in SAS Forecasting and Econometrics</title>
    <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196539#M1223</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Sorry for the confusion. mhi_grp1-4 were derived from mhi_ctg as mutually exclusive dummy variables. As mhi_grp2-4 were included in both regressions, I believe that they were supposed to run the same data set. Thanks.&lt;SPAN style="font-family: 'arial black', 'avant garde'; font-size: 13.3333320617676px; background-color: #ffffff;"&gt; &lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 15 Apr 2015 17:27:07 GMT</pubDate>
    <dc:creator>lizzy28</dc:creator>
    <dc:date>2015-04-15T17:27:07Z</dc:date>
    <item>
      <title>How differently do SAS and STATA deal with missing values when running regressions?</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196534#M1218</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I use both SAS and STATA to run a log-linear regression with the same dataset. The coefficient magnitudes were somehow different. One of the variables in my dataset had 18% missing values. I was wondering whether it was because SAS applied imputation when running regression.&lt;/P&gt;&lt;P&gt;Anyone knows the difference between SAS and STATA in running regression for data with missing values?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks a lot.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Apr 2015 13:44:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196534#M1218</guid>
      <dc:creator>lizzy28</dc:creator>
      <dc:date>2015-04-15T13:44:15Z</dc:date>
    </item>
    <item>
      <title>Re: How differently do SAS and STATA deal with missing values when running regressions?</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196535#M1219</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I don't know about STATA but SAS with the most of the regression procedures will remove any record from the analysis that has any of the model variables missing. The diagnostics of the procedure should tell you how many records were actually used.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Apr 2015 15:13:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196535#M1219</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2015-04-15T15:13:45Z</dc:date>
    </item>
    <item>
      <title>Re: How differently do SAS and STATA deal with missing values when running regressions?</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196536#M1220</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;PRE __jive_macro_name="quote" class="jive_text_macro jive_macro_quote"&gt;
&lt;P&gt;I was wondering whether it was because SAS applied imputation when running regression.&lt;/P&gt;

&lt;/PRE&gt;&lt;P&gt;SAS does not impute missing values in regression. It does not include observations with missing values among the model terms in the regression calculations.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Apr 2015 15:28:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196536#M1220</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2015-04-15T15:28:02Z</dc:date>
    </item>
    <item>
      <title>Re: How differently do SAS and STATA deal with missing values when running regressions?</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196537#M1221</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;Thanks, Ballard.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;But when I run the data with missing values excluded, the result was different from that given by not excluding missing values on purpose. The key difference is that coefficient values were totally different.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;I'm attaching my data in my thread. The way I used was:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;proc reg data=temp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;&amp;nbsp; model lcost_all_adj=age_diag_gp1 age_diag_gp2 age_diag_gp4 age_diag_gp5 mhi_grp2 mhi_grp3 mhi_grp4 days_debr_grp2 days_debr_grp3 days_debr_grp4 flap&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; negpthp days_dswi_grp2 days_dswi_grp3 days_dswi_grp4 los_cs_grp2 los_cs_grp3 los_cs_grp4 los_cs_grp5 comorbid_grp2 comorbid_grp3&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; sepsis transf_bleedcomp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;run;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;This gave me corrected total of 1198, as shown below:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;TABLE cellpadding="5" cellspacing="0" class="table" frame="box" rules="all" summary="Procedure Reg: Number of Observations"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TH class="l rowheader" scope="row"&gt;Number of Observations Read&lt;/TH&gt;&lt;TD class="r data"&gt;1198&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TH class="l rowheader" scope="row"&gt;Number of Observations Used&lt;/TH&gt;&lt;TD class="r data"&gt;1198&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;/P&gt;&lt;TABLE cellpadding="5" cellspacing="0" class="table" frame="box" rules="all" summary="Procedure Reg: Analysis of Variance"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;Analysis of Variance&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD colspan="1"&gt;Source&lt;/TD&gt;&lt;TD colspan="1"&gt;DF&lt;/TD&gt;&lt;TD colspan="1"&gt;Sum of Squares&lt;/TD&gt;&lt;TD colspan="1"&gt;Mean Square&lt;/TD&gt;&lt;TD colspan="1"&gt;F Value&lt;/TD&gt;&lt;TD colspan="1"&gt;Pr&amp;gt;F&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TH class="l rowheader" scope="row"&gt;Model&lt;/TH&gt;&lt;TD class="r data"&gt;23&lt;/TD&gt;&lt;TD class="r data"&gt;441.70924&lt;/TD&gt;&lt;TD class="r data"&gt;19.20475&lt;/TD&gt;&lt;TD class="r data"&gt;24.58&lt;/TD&gt;&lt;TD class="r data"&gt;&amp;lt;.0001&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TH class="l rowheader" scope="row"&gt;Error&lt;/TH&gt;&lt;TD class="r data"&gt;1174&lt;/TD&gt;&lt;TD class="r data"&gt;917.16903&lt;/TD&gt;&lt;TD class="r data"&gt;0.78123&lt;/TD&gt;&lt;TD class="r data"&gt;&lt;/TD&gt;&lt;TD class="r data"&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TH class="l rowheader" scope="row"&gt;Corrected Total&lt;/TH&gt;&lt;TD class="r data"&gt;1197&lt;/TD&gt;&lt;TD class="r data"&gt;1358.87827&lt;/TD&gt;&lt;TD class="r data"&gt;&lt;/TD&gt;&lt;TD class="r data"&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;However, when I particularly excluded the missing values, as below&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 13.3333330154419px;"&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;proc reg data=temp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 13.3333330154419px;"&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;&amp;nbsp; model lcost_all_adj=age_diag_gp1 age_diag_gp2 age_diag_gp4 age_diag_gp5 mhi_grp2 mhi_grp3 mhi_grp4 days_debr_grp2 days_debr_grp3 days_debr_grp4 flap&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 13.3333330154419px;"&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; negpthp days_dswi_grp2 days_dswi_grp3 days_dswi_grp4 los_cs_grp2 los_cs_grp3 los_cs_grp4 los_cs_grp5 comorbid_grp2 comorbid_grp3&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 13.3333330154419px;"&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; sepsis transf_bleedcomp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 13.3333330154419px;"&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;&amp;nbsp; where mhi_ctg^=.;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 13.3333330154419px;"&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;run;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 13.3333330154419px;"&gt;&lt;/P&gt;&lt;P style="font-size: 13.3333330154419px;"&gt;&lt;SPAN style="font-family: 'arial black', 'avant garde';"&gt;I have&lt;/SPAN&gt;&lt;/P&gt;&lt;TABLE cellpadding="5" cellspacing="0" class="table" frame="box" rules="all" summary="Procedure Reg: Number of Observations"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TH class="l rowheader" scope="row"&gt;Number of Observations Read&lt;/TH&gt;&lt;TD class="r data"&gt;992&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TH class="l rowheader" scope="row"&gt;Number of Observations Used&lt;/TH&gt;&lt;TD class="r data"&gt;992&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;/P&gt;&lt;TABLE border="0" cellpadding="0" cellspacing="0" height="149" style="width: 426px; height: 155px;"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD class="xl64" colspan="6" height="20" width="372"&gt;Analysis of Variance&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;Source&lt;/TD&gt;&lt;TD&gt;DF&lt;/TD&gt;&lt;TD class="xl63"&gt;Sum of Squares&lt;/TD&gt;&lt;TD&gt;Mean Square&lt;/TD&gt;&lt;TD&gt;F Value&lt;/TD&gt;&lt;TD class="xl63"&gt;Pr &amp;gt; F&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;Model&lt;/TD&gt;&lt;TD align="right"&gt;23&lt;/TD&gt;&lt;TD align="right" class="xl63"&gt;381.081&lt;/TD&gt;&lt;TD align="right"&gt;16.56874&lt;/TD&gt;&lt;TD align="right"&gt;21.28&lt;/TD&gt;&lt;TD class="xl63"&gt;&amp;lt;.0001&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;Error&lt;/TD&gt;&lt;TD align="right"&gt;968&lt;/TD&gt;&lt;TD align="right" class="xl63"&gt;753.8316&lt;/TD&gt;&lt;TD align="right"&gt;0.77875&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD class="xl63"&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;Corrected Total&lt;/TD&gt;&lt;TD align="right"&gt;991&lt;/TD&gt;&lt;TD align="right" class="xl63"&gt;1134.913&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD class="xl63"&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Apr 2015 16:11:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196537#M1221</guid>
      <dc:creator>lizzy28</dc:creator>
      <dc:date>2015-04-15T16:11:12Z</dc:date>
    </item>
    <item>
      <title>Re: How differently do SAS and STATA deal with missing values when running regressions?</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196538#M1222</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Since your variable mhi_ctg does not appear as a model variable in the first code then those records were not filtered out. When you add it in the second then you are excluding records that have non-missing values for all of the model variables, looks like about 200 of them. I would expect to get different results with about one-fifth of the records excluded.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Apr 2015 17:16:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196538#M1222</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2015-04-15T17:16:30Z</dc:date>
    </item>
    <item>
      <title>Re: How differently do SAS and STATA deal with missing values when running regressions?</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196539#M1223</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Sorry for the confusion. mhi_grp1-4 were derived from mhi_ctg as mutually exclusive dummy variables. As mhi_grp2-4 were included in both regressions, I believe that they were supposed to run the same data set. Thanks.&lt;SPAN style="font-family: 'arial black', 'avant garde'; font-size: 13.3333320617676px; background-color: #ffffff;"&gt; &lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Apr 2015 17:27:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196539#M1223</guid>
      <dc:creator>lizzy28</dc:creator>
      <dc:date>2015-04-15T17:27:07Z</dc:date>
    </item>
    <item>
      <title>Re: How differently do SAS and STATA deal with missing values when running regressions?</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196540#M1224</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I figured out what the problem was. After I recoded &lt;SPAN style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-size: 13px; background-color: #ffffff;"&gt;mhi_ctg into four dummy variables &lt;SPAN style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-size: 13px; background-color: #ffffff;"&gt;mhi_grp1-4, &lt;SPAN style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-size: 13px; background-color: #ffffff;"&gt;mhi_grp1 was excluded from the regression, and &lt;SPAN style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-size: 13px; background-color: #ffffff;"&gt;thus the observations with &lt;/SPAN&gt;missing values in &lt;SPAN style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-size: 13px; background-color: #ffffff;"&gt;mhi_grp1 were &lt;/SPAN&gt;treated the same way as the ones taking 1 in the variable. &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you, Ballard and Paige!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Apr 2015 17:56:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/How-differently-do-SAS-and-STATA-deal-with-missing-values-when/m-p/196540#M1224</guid>
      <dc:creator>lizzy28</dc:creator>
      <dc:date>2015-04-15T17:56:41Z</dc:date>
    </item>
  </channel>
</rss>

