<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Correlating categorical variables in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/787552#M38602</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you in advance for your help. I am trying to a correlation between an ordinal variable and a grouped discrete variable using SAS studio. The first variable is (referred to as "Genome") is likert scale and has 3 levels (agree, undecided, and disagree). The second (referred to as "Events") has 5 levels (0-1, 2-3, 4-5, 6+). I have attached a spreadsheet with this data for reference. I want to know wether event attendance influences acceptance of certain technologies although am running into issues converting the ordinal variables to ranks. For example, I want the variables to be coded: (agree= 1, undecided= 2, disagree= 3) and ( 0-1=1, 2-3=2, 4-5=3 and 6+ =4). To my understanding converting the ordinal variables to ranks is the only way to correlate this data. Is it true?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 28 Dec 2021 21:27:50 GMT</pubDate>
    <dc:creator>KeeganT</dc:creator>
    <dc:date>2021-12-28T21:27:50Z</dc:date>
    <item>
      <title>Correlating categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/787552#M38602</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you in advance for your help. I am trying to a correlation between an ordinal variable and a grouped discrete variable using SAS studio. The first variable is (referred to as "Genome") is likert scale and has 3 levels (agree, undecided, and disagree). The second (referred to as "Events") has 5 levels (0-1, 2-3, 4-5, 6+). I have attached a spreadsheet with this data for reference. I want to know wether event attendance influences acceptance of certain technologies although am running into issues converting the ordinal variables to ranks. For example, I want the variables to be coded: (agree= 1, undecided= 2, disagree= 3) and ( 0-1=1, 2-3=2, 4-5=3 and 6+ =4). To my understanding converting the ordinal variables to ranks is the only way to correlate this data. Is it true?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 28 Dec 2021 21:27:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/787552#M38602</guid>
      <dc:creator>KeeganT</dc:creator>
      <dc:date>2021-12-28T21:27:50Z</dc:date>
    </item>
    <item>
      <title>Re: Correlating categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/787559#M38603</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I would build a model (predicting the acceptance from event attendance).&lt;/P&gt;
&lt;P&gt;You can do that with PROC LOGISTIC for example.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Or you can use CORRESPONDENCE analysis.&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Correspondence analysis is a technique to uncover patterns and structures in&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;categorical&lt;/EM&gt;&lt;SPAN&gt;&amp;nbsp;data.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;SAS/STAT® 15.2 User's Guide&lt;BR /&gt;The CORRESP Procedure&lt;BR /&gt;Getting Started: CORRESP Procedure&lt;BR /&gt;&lt;A href="https://go.documentation.sas.com/doc/en/statug/15.2/statug_corresp_gettingstarted.htm" target="_blank"&gt;https://go.documentation.sas.com/doc/en/statug/15.2/statug_corresp_gettingstarted.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Tue, 28 Dec 2021 22:50:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/787559#M38603</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-12-28T22:50:32Z</dc:date>
    </item>
    <item>
      <title>Re: Correlating categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/787605#M38607</link>
      <description>&lt;P&gt;For category variables, it is called association (correlation).&lt;/P&gt;
&lt;P&gt;Try PROC FREQ.&lt;/P&gt;
&lt;P&gt;or calling&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13633"&gt;@StatDave&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc import datafile='c:\temp\Genome and Event Correlations.xlsx' out=have dbms=xlsx replace;
run;


proc freq data=have;
table genome*events/ chisq;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Ksharp_0-1640781443326.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/66980i414605AF3AF72F5E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Ksharp_0-1640781443326.png" alt="Ksharp_0-1640781443326.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 29 Dec 2021 12:37:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/787605#M38607</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2021-12-29T12:37:29Z</dc:date>
    </item>
    <item>
      <title>Re: Correlating categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/788010#M38640</link>
      <description>&lt;P&gt;See the descriptions of the Lambda and Uncertainty coefficients available from the MEASURES option in the FREQ procedure documentation. The variables in your data set can be used as is with no conversion. For example, the Uncertainty Coefficient (C|R) tells you that the Events variable explains about 4% of the uncertainty in the Genome variable.&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;Alternatively, if you assume that both are ordinal variables and you want to test whether increasing Events increases Genome, then use the JT option after making sure that the levels of both variables are in increasing order.&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc format; 
value $ genf "Disagree"="1 Disgree" "Undecided"="2 Undecided" "Agree"="3 Agree"; 
run;
data z; 
input Genome:$10.	Events $;
datalines;
Undecided	4-5
Undecided	0-1
Agree	0-1
...
;
proc freq order=formatted; 
format genome $genf.;
table events*genome/measures jt; 
run;
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Sat, 01 Jan 2022 23:07:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Correlating-categorical-variables/m-p/788010#M38640</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2022-01-01T23:07:37Z</dc:date>
    </item>
  </channel>
</rss>

