<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Proc Logistic or Proc Corr? in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506126#M26016</link>
    <description>&lt;P&gt;It probably is easier to use Proc Logistic.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can calculate &lt;SPAN&gt;the point-biserial correlation coefficient for 0/1 coded categorical&amp;nbsp;variable with two categories&amp;nbsp;using the Pearson's method.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;With respect to continuous variables,&amp;nbsp;&lt;/P&gt;&lt;P&gt;it depends on whether your data is 1. normally distributed and 2. likely to have outliers. The following thesis showed that Pearson's can be used for non-normally distributed data, but, because it is sensitive to outliers, Pearson's is not good for datasets where outliers are very likely:&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;Pearson’s, Spearman’s and &lt;FONT color="#800080"&gt;Kendall’s&lt;/FONT&gt; correlation coefficients are the most commonly used measures of monotone association, with the latter two &lt;FONT color="#800080"&gt;usually suggested&lt;/FONT&gt;&lt;FONT color="#800080"&gt;for non-normally distributed data&lt;/FONT&gt;.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Pearson’s correlation coefficient could offer a substantial improvement in statistical power even for distributions with moderate skewness...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Nonetheless, because of its known sensitivity to outliers, &lt;FONT color="#800080"&gt;Pearson’s correlation leads to a less powerful statistical test for distributions with &lt;EM&gt;extreme&lt;/EM&gt; skewness or &lt;EM&gt;excess&lt;/EM&gt; of kurtosis (where the datasets with outliers are more likely)&lt;/FONT&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In conclusion,... Pearson’s correlation coefficient could have significant advantages for continuous non-normal data which does not have obvious outliers.&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;U&gt;Reference&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;A href="http://d-scholarship.pitt.edu/8056/1/Chokns_etd2010.pdf" target="_blank"&gt;Pearson's vs. Kendall's Correlation for Continuous Data&lt;/A&gt;&lt;/U&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 19 Oct 2018 19:15:18 GMT</pubDate>
    <dc:creator>pink_poodle</dc:creator>
    <dc:date>2018-10-19T19:15:18Z</dc:date>
    <item>
      <title>Proc Logistic or Proc Corr?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506087#M26015</link>
      <description>&lt;P&gt;Hi Guys,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've posted a bit on this topic, so sorry if you've seen this before but I figured this might be a better board to get my specific question asked.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am honestly fairly new to SAS and was thrown in the deep end, so a bunch of the technical jargon goes over my head, FYI.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm trying to run a Multivariable Analysis that wants to see if there is a correlation between status of disease (1= they have it 0= they don't) and 7 risk factors (symptoms).&lt;/P&gt;&lt;P&gt;5 of them&amp;nbsp; are categorical (1= yes 0-no)&amp;nbsp;&lt;/P&gt;&lt;P&gt;2 of them are continuous&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've dug it down to the fact that&amp;nbsp; I need to use Proc Corr ( I think)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;but do I use Kendall's Tau as my R2 and P value or Pearson's for my my R2 and P-value? ..or both depending on whether it's continuous or categorical?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="proc corr.PNG" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/24214iEB86CE28800BB91B/image-size/large?v=v2&amp;amp;px=999" role="button" title="proc corr.PNG" alt="proc corr.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;----&lt;/P&gt;&lt;P&gt;But HONESTLY, wouldn't it be easier to use proc logistic since it doesn't worry about categorical or continuous and the outcome is binary?&lt;/P&gt;</description>
      <pubDate>Fri, 19 Oct 2018 17:56:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506087#M26015</guid>
      <dc:creator>hpatel3</dc:creator>
      <dc:date>2018-10-19T17:56:44Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic or Proc Corr?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506126#M26016</link>
      <description>&lt;P&gt;It probably is easier to use Proc Logistic.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can calculate &lt;SPAN&gt;the point-biserial correlation coefficient for 0/1 coded categorical&amp;nbsp;variable with two categories&amp;nbsp;using the Pearson's method.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;With respect to continuous variables,&amp;nbsp;&lt;/P&gt;&lt;P&gt;it depends on whether your data is 1. normally distributed and 2. likely to have outliers. The following thesis showed that Pearson's can be used for non-normally distributed data, but, because it is sensitive to outliers, Pearson's is not good for datasets where outliers are very likely:&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;Pearson’s, Spearman’s and &lt;FONT color="#800080"&gt;Kendall’s&lt;/FONT&gt; correlation coefficients are the most commonly used measures of monotone association, with the latter two &lt;FONT color="#800080"&gt;usually suggested&lt;/FONT&gt;&lt;FONT color="#800080"&gt;for non-normally distributed data&lt;/FONT&gt;.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Pearson’s correlation coefficient could offer a substantial improvement in statistical power even for distributions with moderate skewness...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Nonetheless, because of its known sensitivity to outliers, &lt;FONT color="#800080"&gt;Pearson’s correlation leads to a less powerful statistical test for distributions with &lt;EM&gt;extreme&lt;/EM&gt; skewness or &lt;EM&gt;excess&lt;/EM&gt; of kurtosis (where the datasets with outliers are more likely)&lt;/FONT&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In conclusion,... Pearson’s correlation coefficient could have significant advantages for continuous non-normal data which does not have obvious outliers.&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;U&gt;Reference&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;A href="http://d-scholarship.pitt.edu/8056/1/Chokns_etd2010.pdf" target="_blank"&gt;Pearson's vs. Kendall's Correlation for Continuous Data&lt;/A&gt;&lt;/U&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 19 Oct 2018 19:15:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506126#M26016</guid>
      <dc:creator>pink_poodle</dc:creator>
      <dc:date>2018-10-19T19:15:18Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic or Proc Corr?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506158#M26017</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/237382"&gt;@hpatel3&lt;/a&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Why do you start a new thread for essentially the exact same question? That's not necessary, it's not good, and people get confused by trying to assimilate replies in two different threads. Why don't we continue this at your original thread which is: &lt;A href="https://communities.sas.com/t5/SAS-Programming/Multivariate-analysis-Do-I-use-Proc-Corr-or-Proc-GLM-or-Proc-Reg/m-p/505569#M135401" target="_blank"&gt;https://communities.sas.com/t5/SAS-Programming/Multivariate-analysis-Do-I-use-Proc-Corr-or-Proc-GLM-or-Proc-Reg/m-p/505569#M135401&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 19 Oct 2018 20:07:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506158#M26017</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2018-10-19T20:07:15Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic or Proc Corr?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506452#M26023</link>
      <description>&lt;P&gt;Will do, thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 22 Oct 2018 14:21:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-or-Proc-Corr/m-p/506452#M26023</guid>
      <dc:creator>hpatel3</dc:creator>
      <dc:date>2018-10-22T14:21:01Z</dc:date>
    </item>
  </channel>
</rss>

