<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Incomplete stratification? in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241370#M12756</link>
    <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/15128"&gt;@plf515﻿&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Your intuition is right on the nose. &amp;nbsp;The advantage to the means model approach is that I don't have to come up with code and level labels, which is big for me because I am really lazy.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I think means model approaches were touted by Milliken and Johnson because the&amp;nbsp;Type IV hypotheses (designed for missing cell analyses) in PROC GLM were not unique, whereas the means model resulted in unique hypothesis tests. &amp;nbsp;Now that the LSMESTIMATE statement is available, I can see a lot more analyses going this route.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;</description>
    <pubDate>Thu, 31 Dec 2015 13:08:35 GMT</pubDate>
    <dc:creator>SteveDenham</dc:creator>
    <dc:date>2015-12-31T13:08:35Z</dc:date>
    <item>
      <title>Incomplete stratification?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241078#M12743</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is more of a modeling question than a SAS question. &amp;nbsp;I have an experiment I need to analyze where there are two stratification variables: type of tissue and level of viral challenge. &amp;nbsp;The type of tissue is either ectocervical or endocervical. &amp;nbsp;The level of viral challenge is either 50 or 500. &amp;nbsp;There should be 4 combinations: ecto 50, ecto 500, endo 50, and endo 500. &amp;nbsp;The problem is that the lab didn't do one of the combination: endo 50. &amp;nbsp;Here's a table summary of what data exists.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;TABLE style="border-collapse: collapse; width: 144pt;" border="0" width="192" cellspacing="0" cellpadding="0"&gt;&lt;COLGROUP&gt;&lt;COL style="width: 48pt;" span="3" width="64" /&gt; &lt;/COLGROUP&gt;
&lt;TBODY&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD width="64" height="20" class="xl65" style="height: 15.0pt; width: 48pt;"&gt;&amp;nbsp;&lt;/TD&gt;
&lt;TD width="64" class="xl64" style="width: 48pt;"&gt;500&lt;/TD&gt;
&lt;TD width="64" class="xl64" style="width: 48pt;"&gt;50&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD height="20" class="xl66" style="height: 15.0pt;"&gt;Ecto&lt;/TD&gt;
&lt;TD class="xl63"&gt;X&lt;/TD&gt;
&lt;TD class="xl63"&gt;X&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD height="20" class="xl66" style="height: 15.0pt;"&gt;Endo&lt;/TD&gt;
&lt;TD class="xl63"&gt;X&lt;/TD&gt;
&lt;TD class="xl63"&gt;-&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In a model of outcome = tissue|virus, given that I am missing one type of data completely, I cannot test for the overall interaction, but I can get LSMeans for the three pairwise differences of the existing data. &amp;nbsp;I can somewhat assume there is an interaction if the adjusted pairwise differences show significance somewhere.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As it turns out, this model gives the same LSMeans results as if I just combined the two variables into one variable with 3 levels: ecto500, ecto050, and endo500. &amp;nbsp;The model would then just be outcome = tissueVirus. &amp;nbsp;The math to get to the results is slightly different, but the end results are exactly the same estimates, differences, variances, test statistics, and p-values.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The plot thickens, because these two factors are not actually the variables of clinical importance. &amp;nbsp;Instead, there are a whole host of other measurements that need to be tested. &amp;nbsp;This is an exploratory analysis. &amp;nbsp;For each measurement, I'll want to stratify by tissue and virus level. &amp;nbsp;The model would either look like 1) outcome = measurement|tissue|virus or 2) outcome = measurement|tissueVirus.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;My question is: should I use 1 or 2? &amp;nbsp;2 seems simpler, has fewer things to estimate, and doesn't display a ton of non-estimatable results. &amp;nbsp;However, 1 is actually closer to the conceptual design of the experiment. &amp;nbsp;It seems, though, that because of the missing level, that perhaps there is no conceptual difference between 2x2 missing a cell and 1x3.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in advance!&lt;/P&gt;
&lt;P&gt;Michael&lt;/P&gt;</description>
      <pubDate>Mon, 28 Dec 2015 21:12:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241078#M12743</guid>
      <dc:creator>Kastchei</dc:creator>
      <dc:date>2015-12-28T21:12:39Z</dc:date>
    </item>
    <item>
      <title>Re: Incomplete stratification?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241253#M12747</link>
      <description>&lt;P&gt;I would do 2).&amp;nbsp;&amp;nbsp; This seems a lot simpler.&amp;nbsp; The original intent was to do one thing, but it didn't work out that way (so often the case) so iI think you have to adjust.&lt;/P&gt;</description>
      <pubDate>Wed, 30 Dec 2015 17:54:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241253#M12747</guid>
      <dc:creator>plf515</dc:creator>
      <dc:date>2015-12-30T17:54:17Z</dc:date>
    </item>
    <item>
      <title>Re: Incomplete stratification?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241285#M12754</link>
      <description>&lt;P&gt;In Milliken and Johnson's classic text&amp;nbsp;&lt;EM&gt;Analysis of Messy Data&lt;/EM&gt;, they cover this type of thing extensively. &amp;nbsp;Rather than recoding to a single variable, they analyze a "means model", which in this case would look like:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;model dependent_var=tissue*virus;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note that there are no main effects in this--it is a one way ANOVA, so any comparisons have to be done with ESTIMATE or (even better) LSMESTIMATE statements. &amp;nbsp;This has the advantages of retaining the original design and not having to write code to convert two variables to one in a DATA step&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;</description>
      <pubDate>Wed, 30 Dec 2015 19:46:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241285#M12754</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-12-30T19:46:56Z</dc:date>
    </item>
    <item>
      <title>Re: Incomplete stratification?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241302#M12755</link>
      <description>&lt;P&gt;Hi Steve&lt;/P&gt;&lt;P&gt;This sounds like it works out the same as doing the combination in the DATA step.....That' smy intuition anyway.&amp;nbsp; Do you know if my intuition is right?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Peter&lt;/P&gt;</description>
      <pubDate>Wed, 30 Dec 2015 21:07:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241302#M12755</guid>
      <dc:creator>plf515</dc:creator>
      <dc:date>2015-12-30T21:07:10Z</dc:date>
    </item>
    <item>
      <title>Re: Incomplete stratification?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241370#M12756</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/15128"&gt;@plf515﻿&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Your intuition is right on the nose. &amp;nbsp;The advantage to the means model approach is that I don't have to come up with code and level labels, which is big for me because I am really lazy.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I think means model approaches were touted by Milliken and Johnson because the&amp;nbsp;Type IV hypotheses (designed for missing cell analyses) in PROC GLM were not unique, whereas the means model resulted in unique hypothesis tests. &amp;nbsp;Now that the LSMESTIMATE statement is available, I can see a lot more analyses going this route.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;</description>
      <pubDate>Thu, 31 Dec 2015 13:08:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Incomplete-stratification/m-p/241370#M12756</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-12-31T13:08:35Z</dc:date>
    </item>
  </channel>
</rss>

