<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Pearson Correlation test? in New SAS User</title>
    <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/782842#M31946</link>
    <description>&lt;P&gt;&lt;SPAN&gt;Yes, Pearson Correlation ist an important step. But you should also do scatter-plot, as the relation could be non-linear.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Cheers, Markus&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 29 Nov 2021 09:08:06 GMT</pubDate>
    <dc:creator>MarkusWeick</dc:creator>
    <dc:date>2021-11-29T09:08:06Z</dc:date>
    <item>
      <title>Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/782827#M31941</link>
      <description>I have two data sets and I want to see if they are related. I want to know if the air pollution levels by county (data set one) are related to number of poultry farms by county (data set two). This would be a Pearson correlation, correct?</description>
      <pubDate>Mon, 29 Nov 2021 07:40:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/782827#M31941</guid>
      <dc:creator>Pixydust12</dc:creator>
      <dc:date>2021-11-29T07:40:33Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/782842#M31946</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Yes, Pearson Correlation ist an important step. But you should also do scatter-plot, as the relation could be non-linear.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Cheers, Markus&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 29 Nov 2021 09:08:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/782842#M31946</guid>
      <dc:creator>MarkusWeick</dc:creator>
      <dc:date>2021-11-29T09:08:06Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/782848#M31948</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You need one dataset with 3 columns (and not two datasets --&amp;gt; merge them by county).&lt;/P&gt;
&lt;UL class="lia-list-style-type-disc"&gt;
&lt;LI&gt;County = ID-variable&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN&gt;pollution level (col1)&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN&gt;number of poultry farms (col2)&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;As said by&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/401244"&gt;@MarkusWeick&lt;/a&gt;&amp;nbsp;, you first need to graph / plot your data to get better insights on the analysis that might be appropriate.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Pearson correlation can be interesting but it is only measuring linear correlation indeed.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;You can try a simple linear regression as well, but maybe a spline fits the data better?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Koen&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 29 Nov 2021 09:28:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/782848#M31948</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-11-29T09:28:36Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783217#M31981</link>
      <description>Hi Markus,&lt;BR /&gt;So once I do a scatter plot and I see that the relationship is not linear, does that sufficiently answer my research question (To evaluate if air pollution levels are greater in counties near poultry farms), or should I do another test?</description>
      <pubDate>Tue, 30 Nov 2021 17:10:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783217#M31981</guid>
      <dc:creator>Pixydust12</dc:creator>
      <dc:date>2021-11-30T17:10:49Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783219#M31982</link>
      <description>&lt;P&gt;Hi Pixydust,&lt;/P&gt;&lt;P&gt;depends on the image you see. Maybe you'll find another hypothesis for the relation.&lt;/P&gt;&lt;P&gt;Would you like to share the plot?&lt;/P&gt;&lt;P&gt;Cheers&lt;/P&gt;&lt;P&gt;Markus&lt;/P&gt;</description>
      <pubDate>Tue, 30 Nov 2021 17:19:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783219#M31982</guid>
      <dc:creator>MarkusWeick</dc:creator>
      <dc:date>2021-11-30T17:19:10Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783222#M31983</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/406060"&gt;@Pixydust12&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The relationship might be non-linear in any direction of course.&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Air pollution levels can be greater in counties near poultry farms.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Air pollution levels can be smaller / lower in counties near poultry farms.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;The plot will tell you (if not looking like a random scatter), but the plot should support an hypothesis test or a small model that "proves" this.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Koen&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 30 Nov 2021 17:31:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783222#M31983</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-11-30T17:31:13Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783232#M31986</link>
      <description>Question: Would I graph or use a scatter plot? I know air pollution levels are a continuous variable, but is number of poultry farms in each county continuous or discrete?</description>
      <pubDate>Tue, 30 Nov 2021 17:52:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783232#M31986</guid>
      <dc:creator>Pixydust12</dc:creator>
      <dc:date>2021-11-30T17:52:58Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783243#M31988</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Pixydust12_0-1638297098996.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/66291i65219673DFCA2115/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Pixydust12_0-1638297098996.png" alt="Pixydust12_0-1638297098996.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;My code:&lt;/P&gt;&lt;P&gt;proc sgplot data=Iowa;&lt;BR /&gt;reg x=Poultry_Farms y=Air_Pollution/ clm cli;&lt;BR /&gt;run;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Should I even do a scatter plot since one variable is continuous and one is discrete? I think I'm confusing myself. My goal is to see if there is a correlation between the two.&lt;/P&gt;</description>
      <pubDate>Tue, 30 Nov 2021 18:33:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783243#M31988</guid>
      <dc:creator>Pixydust12</dc:creator>
      <dc:date>2021-11-30T18:33:38Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783328#M31989</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;A href="https://communities.sas.com/t5/user/viewprofilepage/user-id/406060" target="_blank"&gt;@Pixydust12&lt;/A&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;That's the plot I asked for. Its no problem to mix contineous and discrete value. To me it looks as there is no relationship. Correspondingly the correlation coefficient should be close to zero.&lt;/P&gt;&lt;P&gt;Cheers Markus&lt;/P&gt;</description>
      <pubDate>Wed, 01 Dec 2021 08:23:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783328#M31989</guid>
      <dc:creator>MarkusWeick</dc:creator>
      <dc:date>2021-12-01T08:23:03Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783416#M32006</link>
      <description>Hi Mark,&lt;BR /&gt;Last question. If I have 5 years worth of air pollution data, but only 1 year worth of poultry farms, could I compare the mean air pollution to poultry farms with a Pearson Correlation? The scatter plot above is an example of the mean levels for the past 5 years and the # of poultry farms for one year.</description>
      <pubDate>Wed, 01 Dec 2021 16:43:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783416#M32006</guid>
      <dc:creator>Pixydust12</dc:creator>
      <dc:date>2021-12-01T16:43:52Z</dc:date>
    </item>
    <item>
      <title>Re: Pearson Correlation test?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783436#M32009</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;A href="https://communities.sas.com/t5/user/viewprofilepage/user-id/406060" target="_blank" rel="noopener"&gt;@Pixydust12&lt;/A&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would assume the number of poultry farms to be rather stable. So taking the 5 years mean&amp;nbsp; for the air polution should be ok. But to be on the safe side (if there is a safe side in statistics), I would also do the plot for the correspondig 1year data of air polution.&lt;/P&gt;&lt;P&gt;Cheers Markus&lt;/P&gt;</description>
      <pubDate>Wed, 01 Dec 2021 18:36:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Pearson-Correlation-test/m-p/783436#M32009</guid>
      <dc:creator>MarkusWeick</dc:creator>
      <dc:date>2021-12-01T18:36:38Z</dc:date>
    </item>
  </channel>
</rss>

