<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Box Cox-Two Distributions in same data set in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313311#M16499</link>
    <description>&lt;P&gt;My goal is to do a linear regression with the data which if not normal would violate the assumptions for the regression.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I chose that bin size because it appeared to coincide with the observed histogram. &amp;nbsp;There may be subpopulations involved but at this point we are not sure but we are investigating.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I was not aware of Proc FMM. &amp;nbsp; Are there any normality requirements for using Proc FMM?&lt;/P&gt;</description>
    <pubDate>Tue, 22 Nov 2016 00:23:09 GMT</pubDate>
    <dc:creator>jacksonan123</dc:creator>
    <dc:date>2016-11-22T00:23:09Z</dc:date>
    <item>
      <title>Box Cox-Two Distributions in same data set</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313185#M16497</link>
      <description>&lt;P&gt;I have non-transformed data &amp;nbsp;whose histograms show&amp;nbsp;two distributions.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Therefore, would it be okay to apply the Box Cox analysis seperately to the distributions above and below CLud &amp;gt; 250000 and Clud&amp;lt;250000 in this case? &amp;nbsp;I ask because when I do seperate analyses, the resulting tests for normality and the QQ plots are vastly improved.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;A follow-up question would be should one analyze as two distributions if the transformed data show two distinct distributions?&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2016 18:33:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313185#M16497</guid>
      <dc:creator>jacksonan123</dc:creator>
      <dc:date>2016-11-21T18:33:30Z</dc:date>
    </item>
    <item>
      <title>Re: Box Cox-Two Distributions in same data set</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313310#M16498</link>
      <description>&lt;P&gt;What is your goal? Why do you want to transform the data to normality? For example, the data could be distributed like an exponential or gamma.&lt;/P&gt;
&lt;P&gt;Try a smaller bin size (maybe 50,000) and see if a decreasing decaying&amp;nbsp;distribution appears.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Are there subpopulations involved? For example, if you plot the heights of students in a classroom, you might get a distribution that is a MIXTURE of the heights of men and women. In that case, you should model the distribution as a finite mixture model. PROC FMM can &amp;nbsp;do that. &amp;nbsp;See &lt;A href="http://blogs.sas.com/content/iml/2011/09/23/modeling-finite-mixtures-with-the-fmm-procedure.html" target="_self"&gt;"Modeling finite mixtures with the FMM procedure."&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 22 Nov 2016 00:14:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313310#M16498</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2016-11-22T00:14:30Z</dc:date>
    </item>
    <item>
      <title>Re: Box Cox-Two Distributions in same data set</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313311#M16499</link>
      <description>&lt;P&gt;My goal is to do a linear regression with the data which if not normal would violate the assumptions for the regression.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I chose that bin size because it appeared to coincide with the observed histogram. &amp;nbsp;There may be subpopulations involved but at this point we are not sure but we are investigating.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I was not aware of Proc FMM. &amp;nbsp; Are there any normality requirements for using Proc FMM?&lt;/P&gt;</description>
      <pubDate>Tue, 22 Nov 2016 00:23:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313311#M16499</guid>
      <dc:creator>jacksonan123</dc:creator>
      <dc:date>2016-11-22T00:23:09Z</dc:date>
    </item>
    <item>
      <title>Re: Box Cox-Two Distributions in same data set</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313316#M16500</link>
      <description>&lt;P&gt;Linear regression does not require that the variables be normal. Neither the response variable nor the explanatory variable(s) have to be normally distributed.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_introreg_sect029.htm" target="_self"&gt;The assumptions for linear regression is presented in the SAS/STAT documentation. T&lt;/A&gt;he important assumption for estimating the parameters is that the &amp;nbsp;errors are identical and are independently distrivbuted. &amp;nbsp; For some inferential statistics (standard errors, confidence intervals,...), the errors&amp;nbsp;are assumed to be normally distributed. Thus for these statistics to be valid a plot of the RESIDUALS&amp;nbsp;should look approximately normal. This is much different than saying that the response is normal.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 22 Nov 2016 01:05:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313316#M16500</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2016-11-22T01:05:02Z</dc:date>
    </item>
    <item>
      <title>Re: Box Cox-Two Distributions in same data set</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313326#M16501</link>
      <description>&lt;PRE&gt;

Or you could try non-parameter regression PROC LOESS .

&lt;/PRE&gt;</description>
      <pubDate>Tue, 22 Nov 2016 02:33:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313326#M16501</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2016-11-22T02:33:51Z</dc:date>
    </item>
    <item>
      <title>Re: Box Cox-Two Distributions in same data set</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313458#M16502</link>
      <description>&lt;P&gt;Not sure if that would help but I may try it later.&lt;/P&gt;</description>
      <pubDate>Tue, 22 Nov 2016 13:18:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Box-Cox-Two-Distributions-in-same-data-set/m-p/313458#M16502</guid>
      <dc:creator>jacksonan123</dc:creator>
      <dc:date>2016-11-22T13:18:25Z</dc:date>
    </item>
  </channel>
</rss>

