<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: how to pick outlier /influential obs in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/567511#M27935</link>
    <description>&lt;P&gt;Thank You for your help. It worked.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 20 Jun 2019 02:59:06 GMT</pubDate>
    <dc:creator>asifgeneral2</dc:creator>
    <dc:date>2019-06-20T02:59:06Z</dc:date>
    <item>
      <title>how to pick outlier /influential obs</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/566774#M27922</link>
      <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am referring the &lt;STRONG&gt;ANOVA and regression tutorial by SAS&lt;/STRONG&gt;, and here is the code the tutor has used for identifying for potential outlier/influential obs&amp;nbsp; &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;%let interval=Gr_Liv_area Basement_area Deck_porch_area Lot_area Age_sold Bedroom_abvGr Total_bathroom;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;ods select none;&lt;BR /&gt;proc glmselect data=stat1.ameshousing3 plots=all;&lt;BR /&gt;Stepwise model saleprice = &amp;amp;interval / selection=stepwise details=steps select=SL slentry=0.05 slstay=0.05;&lt;BR /&gt;run;&lt;BR /&gt;quit;&lt;BR /&gt;ods select all;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;ods graphics on;&lt;BR /&gt;ods output RSTUDENTBYPREDICTED=Rstud&lt;BR /&gt;COOKSDPLOT=Cook&lt;BR /&gt;DFFITSPLOT=Dffits&lt;BR /&gt;DFBETASPANEL=Dfbs;&lt;BR /&gt;proc reg data=stat1.ameshousing3&lt;BR /&gt;plots(only label)=&lt;BR /&gt;(RSTUDENTBYPREDICTED COOKSD DFFITS DFBETAS);&lt;BR /&gt;Siglimit: model salesprice=&amp;amp;_GLSIND;&lt;BR /&gt;title 'siglimit model plots of diagnostics stats';&lt;BR /&gt;run;&lt;BR /&gt;quit;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My question how can I identify potential outlier and influential obs, if I am working with a binary dependent variable and using proc logistic. I have a binary dependent variable where a bad customer coded as 0 and good coded as 1. Can you please help. Thanks&amp;nbsp;&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2019 03:58:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/566774#M27922</guid>
      <dc:creator>asifgeneral2</dc:creator>
      <dc:date>2019-06-18T03:58:57Z</dc:date>
    </item>
    <item>
      <title>Re: how to pick outlier /influential obs</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/566829#M27923</link>
      <description>&lt;P&gt;In practice, you can often use the binary response variable as the response variable in a linear regression model and it works surprisingly well. But don't tell anyone that I said that! &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For linear regression, the influence diagnostics include&lt;A href="https://blogs.sas.com/content/iml/2019/06/17/influence-regression-dfbeta.html" target="_self"&gt; the DFBETAS statistics&lt;/A&gt;, the DFFITS statistics, and Cook's distance (D). Some people also look at the leverage statistic (H). Similar "deletion diagnostics" statistics are available and &lt;A href="https://go.documentation.sas.com/?docsetId=statug&amp;amp;docsetTarget=statug_logistic_details52.htm&amp;amp;docsetVersion=15.1&amp;amp;locale=en" target="_self"&gt;documented in PROC LOGISTIC.&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;- The DFBETAS=_ALL_ option writes the DFBETAS to the output data set.&lt;/P&gt;
&lt;P&gt;- The H= option outputs the leverage statistics&amp;nbsp;&lt;/P&gt;
&lt;P&gt;- There are various kinds of residuals in logistic models, so I'll let you read about the other options.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can use the PLOTS=INFLUENCE option on the PROC LOGISTIC statement to get plots. You can use the INFLUENCE option on the MODEL statement to display a table.&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2019 12:22:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/566829#M27923</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2019-06-18T12:22:11Z</dc:date>
    </item>
    <item>
      <title>Re: how to pick outlier /influential obs</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/566831#M27924</link>
      <description>&lt;P&gt;sDo you check the documentation of PROC LOGISTIC ,especially its example .&lt;/P&gt;
&lt;P&gt;Check Cbar and H(Cook D) statistic.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;proc logistic data=want outest=est(keep=intercept &amp;amp;varlist);&lt;BR /&gt;model good_bad(event='good')= &amp;amp;varlist &lt;BR /&gt;/outroc=x.roc lackfit scale=none aggregate rsquare firth corrb /* selection=stepwise sle=0.1 sls=0.1*/ ;&lt;BR /&gt;output out=output &lt;STRONG&gt;h=h c=c&lt;/STRONG&gt; cbar=cbar predicted=PredProb;&lt;BR /&gt;run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;proc sort data=output out=check_c ;&lt;BR /&gt;by &lt;STRONG&gt;descending c&lt;/STRONG&gt;;&lt;BR /&gt;run;&lt;BR /&gt;proc sort data=output out=check_h ;&lt;BR /&gt;by descending h;&lt;BR /&gt;run;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2019 12:26:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/566831#M27924</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-06-18T12:26:02Z</dc:date>
    </item>
    <item>
      <title>Re: how to pick outlier /influential obs</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/566867#M27926</link>
      <description>&lt;P&gt;All good points from&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/18408"&gt;@Ksharp&lt;/a&gt;&amp;nbsp;.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I would add that DFBETAS, DFFITS and Cook's D from PROC REG really don't apply in the logistic case where the response is binary or ordinal or nominal, because these statistics from PROC REG assume you have continuous Y values, and I would not trust them if Y is not continuous. On the other hand the H (leverage) statistic does not use the value of Y, so it doesn't matter is Y is continuous or not. The other diagnostic statistics from PROC LOGISTIC that have been mentioned all use the proper estimation (maximum likelihood) for the effect on the regression line which takes into account that the response is binary or ordinal or nominal.&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2019 13:44:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/566867#M27926</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-06-18T13:44:47Z</dc:date>
    </item>
    <item>
      <title>Re: how to pick outlier /influential obs</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/567511#M27935</link>
      <description>&lt;P&gt;Thank You for your help. It worked.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Jun 2019 02:59:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/how-to-pick-outlier-influential-obs/m-p/567511#M27935</guid>
      <dc:creator>asifgeneral2</dc:creator>
      <dc:date>2019-06-20T02:59:06Z</dc:date>
    </item>
  </channel>
</rss>

