<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Analyzing positively skewed continuous outcome variable in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723874#M35088</link>
    <description>&lt;P&gt;Positively valued and skewed responses are often modeled using the gamma or inverse gaussian distribution as are available with the DIST= option in PROC GENMOD.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 05 Mar 2021 14:19:39 GMT</pubDate>
    <dc:creator>StatDave</dc:creator>
    <dc:date>2021-03-05T14:19:39Z</dc:date>
    <item>
      <title>Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723858#M35086</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am trying to model a continuous outcome variable which is highly skewed. I have several predictor variables in the model both continuous and categorical. The q-q plot of the residuals is shown below. As you can see, the normality assumption is clearly violated. I tried log transforming the outcome variable but it doesn't seem to fix the problem. Any body has an idea of how to remedy this issue ? Does the central limit theorem apply here?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;Here is the code used:&lt;/P&gt;&lt;P&gt;proc glmselect data=b;&lt;BR /&gt;class a b&amp;nbsp; c d e / param=reference;&lt;BR /&gt;model y=a&amp;nbsp; b c d e f ;&lt;BR /&gt;output out=check r=residuals;&lt;BR /&gt;run;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;proc univariate data=check;&lt;BR /&gt;var residuals;&lt;BR /&gt;histogram residuals / normal kernel;&lt;BR /&gt;qqplot residuals / normal(mu=est sigma=est);&lt;BR /&gt;run;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="chepchep_0-1614950220928.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/55495i11A89FD81D8F67BE/image-size/medium?v=v2&amp;amp;px=400" role="button" title="chepchep_0-1614950220928.png" alt="chepchep_0-1614950220928.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Mar 2021 13:21:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723858#M35086</guid>
      <dc:creator>chepchep</dc:creator>
      <dc:date>2021-03-05T13:21:24Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723874#M35088</link>
      <description>&lt;P&gt;Positively valued and skewed responses are often modeled using the gamma or inverse gaussian distribution as are available with the DIST= option in PROC GENMOD.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Mar 2021 14:19:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723874#M35088</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2021-03-05T14:19:39Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723933#M35095</link>
      <description>&lt;P&gt;What does the histogram of the residuals look like? Is there more than one mode? This would signal that you are missing some important effect, or some important interaction(s).&lt;/P&gt;</description>
      <pubDate>Fri, 05 Mar 2021 18:01:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723933#M35095</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2021-03-05T18:01:52Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723940#M35096</link>
      <description>&lt;P&gt;This is how the histogram looks like:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="chepchep_0-1614968368545.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/55500iF0D6F7B769B0EA1E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="chepchep_0-1614968368545.png" alt="chepchep_0-1614968368545.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Mar 2021 18:20:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723940#M35096</guid>
      <dc:creator>chepchep</dc:creator>
      <dc:date>2021-03-05T18:20:45Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723943#M35097</link>
      <description>&lt;P&gt;Thanks for your input. So how do I know which of the two to use, can either one of them work?&lt;/P&gt;</description>
      <pubDate>Fri, 05 Mar 2021 18:22:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723943#M35097</guid>
      <dc:creator>chepchep</dc:creator>
      <dc:date>2021-03-05T18:22:40Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723955#M35098</link>
      <description>&lt;P&gt;Personally, I would work on developing a better model. Use regression diagnostic plots to analyze whether you should include second-order interaction terms in the model. Since you are using PROC GLMSELECT, you can add in all second-order terms and use variable selection to see if any interactions improve the fit enough to make it into the final model.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Mar 2021 18:31:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723955#M35098</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2021-03-05T18:31:14Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723967#M35099</link>
      <description>&lt;P&gt;You can use PROC SEVERITY in SAS/ETS to assess the fit of several distributions, including gamma and inverse gaussian and others. For example:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc severity data=b crit=aicc;
   loss y;
   dist _predefined_;
run;
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Fri, 05 Mar 2021 18:59:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/723967#M35099</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2021-03-05T18:59:00Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/724536#M35124</link>
      <description>&lt;P&gt;Thank you so much for your input. So I did use the proc severity to select the the distribution that best fits my data and the Burr distribution was selected. This is a distribution that I am not very familiar with. How do I fit a Burr distribution in SAS?&lt;/P&gt;&lt;P&gt;Here is the partial output from proc severity:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;Distribution Converged AICC SelectedBurrExpGammaIgaussLognParetoGpdWeibull &lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;TD&gt;361869&lt;/TD&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;TD&gt;382973&lt;/TD&gt;&lt;TD&gt;No&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;TD&gt;371532&lt;/TD&gt;&lt;TD&gt;No&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;TD&gt;368401&lt;/TD&gt;&lt;TD&gt;No&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;TD&gt;365528&lt;/TD&gt;&lt;TD&gt;No&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;TD&gt;382977&lt;/TD&gt;&lt;TD&gt;No&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;TD&gt;382975&lt;/TD&gt;&lt;TD&gt;No&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Yes&lt;/TD&gt;&lt;TD&gt;377067&lt;/TD&gt;&lt;TD&gt;No&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 08 Mar 2021 15:47:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/724536#M35124</guid>
      <dc:creator>chepchep</dc:creator>
      <dc:date>2021-03-08T15:47:15Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/724541#M35125</link>
      <description>&lt;P&gt;I suggest you look at the plots (CDF/EDF and PDF) to visually assess how close the other distributions are to the EDF of the observed data. It's not so much a matter of picking the one with the lowest AICC as it is rejecting distributions that clearly don't fit well and picking one that does fit reasonably well.&lt;/P&gt;</description>
      <pubDate>Mon, 08 Mar 2021 16:00:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/724541#M35125</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2021-03-08T16:00:55Z</dc:date>
    </item>
    <item>
      <title>Re: Analyzing positively skewed continuous outcome variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/724558#M35127</link>
      <description>&lt;P&gt;Thank you so much!&lt;/P&gt;</description>
      <pubDate>Mon, 08 Mar 2021 16:34:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Analyzing-positively-skewed-continuous-outcome-variable/m-p/724558#M35127</guid>
      <dc:creator>chepchep</dc:creator>
      <dc:date>2021-03-08T16:34:48Z</dc:date>
    </item>
  </channel>
</rss>

