<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: What is the best transfomation for my data? in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599778#M29176</link>
    <description>&lt;P&gt;I don't think there is any transformation that will result in normal residuals. You can see from the Q-Q plot that about half of the data have zero residuals. This can happen if, for example, the Y6 variable is rounded so that it has a small number of discrete values.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It might be that Y6 has a large number of repeated values.&amp;nbsp;You can check the distribution of the Y6 variable by running&amp;nbsp;&lt;/P&gt;
&lt;PRE class="sas"&gt;&lt;SPAN&gt;proc freq&lt;/SPAN&gt; &lt;SPAN&gt;data&lt;/SPAN&gt;=AU &lt;SPAN&gt;ORDER&lt;/SPAN&gt;=FREQ;
  tables Y6 / maxlevels=&lt;SPAN&gt;10&lt;/SPAN&gt;;   /* only print top 10 */
&lt;SPAN&gt;run&lt;/SPAN&gt;;&lt;/PRE&gt;
&lt;P&gt;For details see&amp;nbsp;&lt;A href="https://blogs.sas.com/content/iml/2018/06/04/top-10-table-bar-chart.html" target="_blank"&gt;https://blogs.sas.com/content/iml/2018/06/04/top-10-table-bar-chart.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 28 Oct 2019 13:07:13 GMT</pubDate>
    <dc:creator>Rick_SAS</dc:creator>
    <dc:date>2019-10-28T13:07:13Z</dc:date>
    <item>
      <title>What is the best transfomation for my data?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599715#M29166</link>
      <description>&lt;P&gt;Dear all.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How do I know how to best transform my data so that I have residual normally distributed?&lt;/P&gt;&lt;P&gt;I am fitting the following model to my data:&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;PROC MIXED DATA=AU;
CLASS LOCAL ESPE REP ESTA;
MODEL Y6=LOCAL|ESPE|ESTA /HTYPE=3 RESIDUAL INFLUENCE;
RANDOM REP(LOCAL*ESPE);
REPEATED ESTA/TYPE=TOEP(2) SUB=REP(LOCAL*ESPE) GROUP=LOCAL*ESPECIE;&lt;BR /&gt;ODS OUTPUT "Influence Diagnostics"=DIAGNOSTIC;
RUN;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;However, the residual did not have a normal distribution as a result below:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="DIAGNOSTIC.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/33432i46536CEC27A56D3B/image-size/large?v=v2&amp;amp;px=999" role="button" title="DIAGNOSTIC.jpg" alt="DIAGNOSTIC.jpg" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="normal.jpg" style="width: 447px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/33431iE9193A757F293D87/image-size/large?v=v2&amp;amp;px=999" role="button" title="normal.jpg" alt="normal.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Is there any procedure to know how to transform the data?&amp;nbsp;I think Box-Cox transformation is not a good idea, because my response variable is categorical factors (LOCAL, ESPE, ESTA), not continuous values.&lt;/P&gt;&lt;P&gt;I appreciate all the help. Thanks.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2019 02:56:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599715#M29166</guid>
      <dc:creator>vitormacedo</dc:creator>
      <dc:date>2019-10-28T02:56:28Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best transfomation for my data?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599765#M29168</link>
      <description>&lt;P&gt;You could check ASSESS statement of PROC GENMOD ,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But I don't know if it could handle category variable ,since you claim your variable is category .&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2019 12:42:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599765#M29168</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-10-28T12:42:48Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best transfomation for my data?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599778#M29176</link>
      <description>&lt;P&gt;I don't think there is any transformation that will result in normal residuals. You can see from the Q-Q plot that about half of the data have zero residuals. This can happen if, for example, the Y6 variable is rounded so that it has a small number of discrete values.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It might be that Y6 has a large number of repeated values.&amp;nbsp;You can check the distribution of the Y6 variable by running&amp;nbsp;&lt;/P&gt;
&lt;PRE class="sas"&gt;&lt;SPAN&gt;proc freq&lt;/SPAN&gt; &lt;SPAN&gt;data&lt;/SPAN&gt;=AU &lt;SPAN&gt;ORDER&lt;/SPAN&gt;=FREQ;
  tables Y6 / maxlevels=&lt;SPAN&gt;10&lt;/SPAN&gt;;   /* only print top 10 */
&lt;SPAN&gt;run&lt;/SPAN&gt;;&lt;/PRE&gt;
&lt;P&gt;For details see&amp;nbsp;&lt;A href="https://blogs.sas.com/content/iml/2018/06/04/top-10-table-bar-chart.html" target="_blank"&gt;https://blogs.sas.com/content/iml/2018/06/04/top-10-table-bar-chart.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2019 13:07:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599778#M29176</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2019-10-28T13:07:13Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best transfomation for my data?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599930#M29182</link>
      <description>&lt;P&gt;My variable response represents proportion of plant parts. Y6 is the proportion of flowers, which in many observations is zero.&lt;/P&gt;&lt;P&gt;Should I then work with the GLIMMIX procedure instead of transforming the data?&lt;/P&gt;&lt;P&gt;My goal is to test the difference between the factors (LOCAL, ESPE, ESTA), and the effect of interactions. What's command should I use?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2019 20:26:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/599930#M29182</guid>
      <dc:creator>vitormacedo</dc:creator>
      <dc:date>2019-10-28T20:26:08Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best transfomation for my data?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/600068#M29185</link>
      <description>&lt;P&gt;If your response is a proportion, you need to account for that. If you Google "sas regression proportion" you'll find many references and papers. Two from SAS include&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/kb/57/480.html" target="_blank"&gt;http://support.sas.com/kb/57/480.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/kb/56/992.html" target="_blank"&gt;http://support.sas.com/kb/56/992.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Since you might have an inflated number of zeros, you can also search for papers about zero-inflated models.&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2019 13:55:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/What-is-the-best-transfomation-for-my-data/m-p/600068#M29185</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2019-10-29T13:55:05Z</dc:date>
    </item>
  </channel>
</rss>

