<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic ANOVA for skewed data with complex survey design? (SURVEYREG? PROC GLM?) in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573929#M28217</link>
    <description>&lt;P&gt;Hi SAS friends! Hoping for some advice (and new ideas) here...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to use ANOVA to evaluate the relationship between an independent categorical variable with multiple levels and a dependent continuous variable.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I used PROC GLM to conduct my test and also requested some nonparametric test options and tests for unequal variance (Levene's, Welch's ANOVA). The distribution of my dependent variable is heavily skewed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here's my original code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;ods graphics on;

proc glm data = mydata plots(maxpoints=none)=diagnostics;
	class  independent;
	model dependent = independent;
	means  independent/hovtest welch;
run;

ods graphics off;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Then, I realized that since the survey design includes weighting and stratification variables that I needed to take those into account. PROC GLM allowed me to add the weighting variable but doesn't appear to have options for nonparametric tests. I switched to PROC SURVEYREG which allowed for the inclusion of both weighting and stratification variables but still no test options beyond the initial ANOVA.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here's my amended code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc surveyreg data = mydata;
     weight weightvar;
     strata stratavar;
     model dependent = independent / anova;
run;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Should I be using a different PROC? A totally different test? Is there an option that I'm missing in SURVEYREG? Help!&lt;/P&gt;</description>
    <pubDate>Tue, 16 Jul 2019 16:32:14 GMT</pubDate>
    <dc:creator>clambert22</dc:creator>
    <dc:date>2019-07-16T16:32:14Z</dc:date>
    <item>
      <title>ANOVA for skewed data with complex survey design? (SURVEYREG? PROC GLM?)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573929#M28217</link>
      <description>&lt;P&gt;Hi SAS friends! Hoping for some advice (and new ideas) here...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to use ANOVA to evaluate the relationship between an independent categorical variable with multiple levels and a dependent continuous variable.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I used PROC GLM to conduct my test and also requested some nonparametric test options and tests for unequal variance (Levene's, Welch's ANOVA). The distribution of my dependent variable is heavily skewed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here's my original code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;ods graphics on;

proc glm data = mydata plots(maxpoints=none)=diagnostics;
	class  independent;
	model dependent = independent;
	means  independent/hovtest welch;
run;

ods graphics off;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Then, I realized that since the survey design includes weighting and stratification variables that I needed to take those into account. PROC GLM allowed me to add the weighting variable but doesn't appear to have options for nonparametric tests. I switched to PROC SURVEYREG which allowed for the inclusion of both weighting and stratification variables but still no test options beyond the initial ANOVA.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here's my amended code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc surveyreg data = mydata;
     weight weightvar;
     strata stratavar;
     model dependent = independent / anova;
run;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Should I be using a different PROC? A totally different test? Is there an option that I'm missing in SURVEYREG? Help!&lt;/P&gt;</description>
      <pubDate>Tue, 16 Jul 2019 16:32:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573929#M28217</guid>
      <dc:creator>clambert22</dc:creator>
      <dc:date>2019-07-16T16:32:14Z</dc:date>
    </item>
    <item>
      <title>Re: ANOVA for skewed data with complex survey design? (SURVEYREG? PROC GLM?)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573939#M28218</link>
      <description>&lt;P&gt;The data can be skewed, this isn't a problem for GLM or SURVEYREG. The actual condition required is that the residuals (the difference between predicted and actual values) are normally distributed. You can examine the residuals and see if they follow a normal distribution or not.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Assuming the residuals are normally distributed, I would think that SURVEYREG would handle the weighting properly.&lt;/P&gt;</description>
      <pubDate>Tue, 16 Jul 2019 17:08:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573939#M28218</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-07-16T17:08:27Z</dc:date>
    </item>
    <item>
      <title>Re: ANOVA for skewed data with complex survey design? (SURVEYREG? PROC GLM?)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573943#M28219</link>
      <description>&lt;P&gt;Thanks for the reply!! &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I checked and unfortunately the residuals are also heavily skewed. I'm thinking that maybe I'm just not looking at this correctly and need to adjust which test I'm using/my research question?&lt;/P&gt;</description>
      <pubDate>Tue, 16 Jul 2019 17:21:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573943#M28219</guid>
      <dc:creator>clambert22</dc:creator>
      <dc:date>2019-07-16T17:21:15Z</dc:date>
    </item>
    <item>
      <title>Re: ANOVA for skewed data with complex survey design? (SURVEYREG? PROC GLM?)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573944#M28220</link>
      <description>&lt;P&gt;Can you show us a screen capture of the residual plot?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If they are skewed, perhaps a transformation of the data would help (depending on the severity of the skewing) to achieve the normal distribution of the residuals.&lt;/P&gt;</description>
      <pubDate>Tue, 16 Jul 2019 17:25:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573944#M28220</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-07-16T17:25:41Z</dc:date>
    </item>
    <item>
      <title>Re: ANOVA for skewed data with complex survey design? (SURVEYREG? PROC GLM?)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573998#M28223</link>
      <description>&lt;P&gt;Here are all of the diagnostic plots (let me know if this is what you meant!). Thank you so much for your help!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-right" image-alt="screencap.png" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/31080i81D82CEE80C8E880/image-size/large?v=v2&amp;amp;px=999" role="button" title="screencap.png" alt="screencap.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 16 Jul 2019 20:19:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/573998#M28223</guid>
      <dc:creator>clambert22</dc:creator>
      <dc:date>2019-07-16T20:19:58Z</dc:date>
    </item>
    <item>
      <title>Re: ANOVA for skewed data with complex survey design? (SURVEYREG? PROC GLM?)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/574141#M28230</link>
      <description>&lt;P&gt;Obviously, the residuals are not normally distributed, and its not obvious to me that you can transform the data to make them normal. So, I would then consider non-parametric methods, although I'm not sure how the survey weights would apply.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jul 2019 12:10:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/574141#M28230</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-07-17T12:10:30Z</dc:date>
    </item>
    <item>
      <title>Re: ANOVA for skewed data with complex survey design? (SURVEYREG? PROC GLM?)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/951703#M47605</link>
      <description>&lt;P&gt;I agree with what &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;has said. The residuals of your model is clearly non-normal. Both ANOVA and linear regression are not applicable.&lt;/P&gt;
&lt;P&gt;My suggestions on your problem is that nonparametric tests and quantile regression can be employed for inter-group comparisons of quantiles instead of means, a statistic that ANOVA aims to compare among groups.&lt;/P&gt;
&lt;P&gt;For instance,&amp;nbsp;&lt;A href="https://www.sciencedirect.com/science/article/pii/S0169260714002958" target="_blank"&gt;Median and quantile tests under complex survey design using SAS and R - ScienceDirect&lt;/A&gt;&amp;nbsp;contains a SAS program for testing for the equality of medians and other quantiles among groups.&amp;nbsp;&lt;A href="https://academic.oup.com/jssam/article-abstract/7/2/157/5146447?redirectedFrom=fulltext&amp;amp;login=false" target="_blank"&gt;Quantile Regression Analysis of Survey Data Under Informative Sampling | Journal of Survey Statistics and Methodology | Oxford Academic&lt;/A&gt;&amp;nbsp;and other research papers discuss quantile regression of complex survey data.&lt;/P&gt;
&lt;P&gt;There is a note on the ANOVA option in the SURVEYREG procedure. As page 104 of&amp;nbsp;&lt;A href="https://www.taylorfrancis.com/books/mono/10.1201/9781315366906/complex-survey-data-analysis-sas-taylor-lewis" target="_self"&gt;Complex Survey Data Analysis with SAS | Taylor H. Lewis | Taylor &amp;amp; Francis&lt;/A&gt;&amp;nbsp;says, this option should be deprecated in analysis in that the &lt;EM&gt;F&lt;/EM&gt; statistic in the ANOVA table reported by PROC SURVEYREG does not carry the same interpretation as it has for analysis of simple random sampling data and should be ignored. In other words, the &lt;EM&gt;F&lt;/EM&gt; statistic reported in the ANOVA table does not represent the test statistic for the null hypothesis that all parameters are jointly equal to zero except for the intercept. Instead, it adviced by the author that analysts should refer to the line of "Model" in the table entitled "Tests of Model Effects" in the output produced by the SURVEYREG procedure for the correct&amp;nbsp;statistic.&lt;/P&gt;</description>
      <pubDate>Sat, 23 Nov 2024 13:12:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/ANOVA-for-skewed-data-with-complex-survey-design-SURVEYREG-PROC/m-p/951703#M47605</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2024-11-23T13:12:35Z</dc:date>
    </item>
  </channel>
</rss>

