<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Unbalanced data using MIXED, ANOVA, GLM in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920789#M45739</link>
    <description>&lt;P&gt;It is best just to avoid PROC ANOVA. It was meant as a procedure for textbook-type problems. The method behind PROC ANOVA requires balanced data and there is no way to work around that requirement. Use GLM for your modeling needs that use only fixed effects. For models with random effects and/or repeated measures, use PROC MIXED.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 18 Mar 2024 16:56:52 GMT</pubDate>
    <dc:creator>StatsMan</dc:creator>
    <dc:date>2024-03-18T16:56:52Z</dc:date>
    <item>
      <title>Unbalanced data using MIXED, ANOVA, GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920654#M45729</link>
      <description>&lt;P&gt;Dear SAS community,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am interested to understand how I can use PROC MIXED to fit an ANOVA repeated measures with unbalanced data in a realiable way.&amp;nbsp;I found several posts&amp;nbsp;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&amp;nbsp;&lt;A href="https://communities.sas.com/t5/Statistical-Procedures/PROC-MIXED-vs-ANOVA/td-p/15152" target="_blank"&gt;Solved: PROC MIXED vs. ANOVA - SAS Support Communities&lt;/A&gt;&lt;OL&gt;&lt;LI&gt;No codes are provided, but discuss pros and cons of using both procedure.&amp;nbsp;&lt;/LI&gt;&lt;/OL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;A href="https://www.lexjansen.com/wuss/2009/pos/POS-Greene.pdf" target="_blank"&gt;Microsoft Word - A beginner's example of PROC MIXED- Sarah R Greene.doc (lexjansen.com)&lt;/A&gt;&lt;OL&gt;&lt;LI&gt;Transposing data (wide format) and use of RANDOM statment&lt;/LI&gt;&lt;LI&gt;CODE:&lt;OL&gt;&lt;LI&gt;Proc MIXED DATA=mydata.alldata_analysis1;&lt;BR /&gt;CLASS word_type word_length subject;&lt;BR /&gt;MODEL rt= word_type word_length word_type*word_length ddfm=bw;&lt;BR /&gt;RANDOM intercept /sub=subject type=un;&lt;BR /&gt;LSMEANS word_type*word_length;&lt;BR /&gt;run;&lt;/LI&gt;&lt;/OL&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;When I am fitting an ANOVA model in PROC MIXED,&amp;nbsp;I hope to see similar behaviour than ANOVA (such as GLM) with respect the unbalanced observations (not being included in the model). However, when I used the PROC mixed unbalanced observations are used.&amp;nbsp;&lt;/P&gt;&lt;P&gt;thanks in advance&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kind regards&lt;/P&gt;&lt;P&gt;Philippe&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Mar 2024 09:50:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920654#M45729</guid>
      <dc:creator>tadgerviloria</dc:creator>
      <dc:date>2024-03-18T09:50:15Z</dc:date>
    </item>
    <item>
      <title>Re: Unbalanced data using MIXED, ANOVA, GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920667#M45730</link>
      <description>&lt;P&gt;Do NOT use PROC ANOVA for unbalanced data. It is my understanding that both PROC MIXED and PROC GLM handle unbalanced data properly, and the complete unbalanced data is used in the analysis. I don't know what it means to say the unbalanced observations are not used in the model -- the concept doesn't even make sense to me.&lt;/P&gt;</description>
      <pubDate>Mon, 18 Mar 2024 10:58:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920667#M45730</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2024-03-18T10:58:29Z</dc:date>
    </item>
    <item>
      <title>Re: Unbalanced data using MIXED, ANOVA, GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920675#M45731</link>
      <description>&lt;P&gt;Dear Paige&amp;nbsp;&lt;/P&gt;&lt;P&gt;Many thanks for your prompt reponse.&amp;nbsp;&lt;/P&gt;&lt;P&gt;My understanding is that balanced and unbalanced data is a term equivalent for complete/uncomplete cases.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;"A repeated measures ANOVA requires a balanced number of repeated measurements for each experimental unit. Due to this requirement, experimental units with missing measurements are completely excluded from the analysis" (&lt;A href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10231988/" target="_blank"&gt;Guidelines for repeated measures statistical analysis approaches with basic science research considerations - PMC (nih.gov)&lt;/A&gt;)&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;When I said&amp;nbsp;the "the unbalanced observations [are] not being included in the model" is a similar statement than previous guideline paper saying&amp;nbsp; "[the] missing measurements are completely excluded from the analysis". This is a behaviour that I would expect to see in any Anova model process. I wonder if this could be done in PROC MIXED, since the way I am fitting the model is keeping the unbalanced observation (uncomplete cases) in the model. Please see below the used code:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;PRE&gt;proc mixed data = DATA_ANOVA;
  class ID  TRT VISIT ;
  model chg = TRT TRT*VISIT / solution cl;
  repeated / subject=ID type =   AR(1);
run;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks in advance&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Philippe&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Mar 2024 11:52:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920675#M45731</guid>
      <dc:creator>tadgerviloria</dc:creator>
      <dc:date>2024-03-18T11:52:54Z</dc:date>
    </item>
    <item>
      <title>Re: Unbalanced data using MIXED, ANOVA, GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920701#M45732</link>
      <description>&lt;P&gt;Balanced/Unbalanced data refers to the counts in each cell of your design (same number of observations per treatment group). PROC ANOVA requires balanced data in the design, PROC GLM and PROC MIXED do not.&lt;/P&gt;
&lt;P&gt;The data situation you describe is slightly different. Subjects with incomplete repeated measures are not included in PROC GLM. The method of moments used in GLM requires complete data for each subject. Subjects with incomplete data are used in PROC MIXED. Maximum likelihood methods do not require that subjects have observations for all time points. MIXED does allow only one observation per time point for a subject. So, GLM and MIXED will not agree if you have incomplete data on your subjects in a repeated measures analysis.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Mar 2024 12:48:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920701#M45732</guid>
      <dc:creator>StatsMan</dc:creator>
      <dc:date>2024-03-18T12:48:19Z</dc:date>
    </item>
    <item>
      <title>Re: Unbalanced data using MIXED, ANOVA, GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920710#M45733</link>
      <description>&lt;P&gt;Dear StatsMan&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Many thanks for your response!&lt;BR /&gt;Indeed, that it is my experience, PROC MIXED keeps the uncomplete cases in an Anova respeated measures,&amp;nbsp;but PROC GLM is not. It is nice to know the reason behind (MLE in PROC MIXED, and MoM in PROC GLM) the discrepancy.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;When you said "&lt;SPAN&gt;PROC ANOVA requires balanced data in the design, PROC GLM and PROC MIXED do not.&lt;/SPAN&gt;" are you refering to a Pre-post design (and no repeated measures in the middle) ? Why the diferences between&amp;nbsp; PROC GLM and PROC ANOVA in this context?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Do you know what is PROC ANOVA expected to do in&amp;nbsp;an Anova repeated measures when&amp;nbsp;uncomplete cases are observed?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;&lt;P&gt;Philippe&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Mar 2024 13:16:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920710#M45733</guid>
      <dc:creator>tadgerviloria</dc:creator>
      <dc:date>2024-03-18T13:16:28Z</dc:date>
    </item>
    <item>
      <title>Re: Unbalanced data using MIXED, ANOVA, GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920789#M45739</link>
      <description>&lt;P&gt;It is best just to avoid PROC ANOVA. It was meant as a procedure for textbook-type problems. The method behind PROC ANOVA requires balanced data and there is no way to work around that requirement. Use GLM for your modeling needs that use only fixed effects. For models with random effects and/or repeated measures, use PROC MIXED.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Mar 2024 16:56:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920789#M45739</guid>
      <dc:creator>StatsMan</dc:creator>
      <dc:date>2024-03-18T16:56:52Z</dc:date>
    </item>
    <item>
      <title>Re: Unbalanced data using MIXED, ANOVA, GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920790#M45740</link>
      <description>&lt;P&gt;From the on-line help for Proc Anova in the Overview section at the start:&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;DIV class="xisDoc-refProc"&gt;
&lt;DIV id="statug.anova.anova_sec_overview" class="aa-section"&gt;
&lt;DIV class="xisDoc-title"&gt;
&lt;DIV&gt;
&lt;DIV&gt;
&lt;H2 class="xisDoc-title"&gt;Overview: ANOVA Procedure&lt;/H2&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;P class="xisDoc-paragraph"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="xisDoc-paragraph"&gt;The ANOVA procedure performs &lt;SPAN&gt;&lt;EM&gt;analysis of variance&lt;/EM&gt;&lt;/SPAN&gt; (ANOVA) for &lt;FONT size="5"&gt;&lt;STRONG&gt;balanced data&lt;/STRONG&gt;&lt;/FONT&gt; from a wide variety of experimental designs. In analysis of variance, a continuous response variable, known as a &lt;SPAN&gt;&lt;EM&gt;dependent variable&lt;/EM&gt;&lt;/SPAN&gt;, is measured under experimental conditions identified by classification variables, known as &lt;SPAN&gt;&lt;EM&gt;independent variables&lt;/EM&gt;&lt;/SPAN&gt;. &amp;nbsp; &lt;STRONG&gt;The variation in the response is assumed to be due to effects in the classification, with rando&lt;A id="statug.anova.anova_a0000000006" class="indexterm" target="_blank"&gt;&lt;/A&gt;m error accounting for the remaining variation.&lt;/STRONG&gt;&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Emphasis added.&lt;/P&gt;
&lt;P&gt;Traditional ANOVA in my personal opinion had exactly one advantage: traditional calculations could be done by hand. ( And I've done them that way because the computers I had available in the 1970's didn't have appropriate software).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Mar 2024 17:07:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Unbalanced-data-using-MIXED-ANOVA-GLM/m-p/920790#M45740</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2024-03-18T17:07:43Z</dc:date>
    </item>
  </channel>
</rss>

