Statistical Procedures

drteju · Posted 01-27-2020 10:13 PM

I am trying to write code in SAS University edition to do a 3-way cross tabulation. For example, if I have 3 categorical variables and I want to look at presisting condition status(preex_status) and insurance coverage(coverage_status) by year whether before or after the policy went into effect (year_class). Each variable has two categories the year_Class has 'pre' and 'post' and the other two have 0 and 1 for yes or no response. I wrote following code:

proc freq data = mydata;

tables preex_status*coverage_status*year_Class / chisq;

run;

However it gives a 2*2 table controlling for preex_status. I cannot use the 'By' statement gives me an error saying "data is not sorted in ascending sequence. I was expecting a table as below:

	Pre	Post
Coverage status	Pre-existing condition	No	Pre-existing condition	No
Yes
No

Is this possible or am I using the wrong statistical test? Can someone help me solve it? Thank yo

drteju · Posted 01-27-2020 11:27 PM

Thanks but that won't work. But i think i might have solved it. I just created a new variable using condition statement and created four categories for the new variable where 1. a person has condition and insurance. 2. Has condition but no insurance 3. No condition but has insurance 4. no condition and no insurance then used this new variable for a two way table with year variable using code:

proc freq data=mythesis.jdata ;

  tables condition_coverage*year_class/ chisq;
  weight perweight1;
run;

and got following output:

Table of condition_coverage by year_class
condition_coverage	year_class
Frequency Percent Row Pct Col Pct	post	pre	Total
nocond_hascovera	3.364E7 21.26 51.85 42.74	3.124E7 19.74 48.15 39.28	6.488E7 41.01
nocond_nocoverag	5818615 3.68 37.69 7.39	9618986 6.08 62.31 12.10	1.544E7 9.76
preex_nocoverage	3492139 2.21 36.03 4.44	6198915 3.92 63.97 7.80	9691053 6.13
prex_hascoverage	3.575E7 22.60 52.41 45.43	3.246E7 20.52 47.59 40.82	6.821E7 43.11
Total	7.87E7 49.74	7.951E7 50.26	1.582E8 100.00

Looks like that solves the question. Will update once i get it verified if this solution is correct. Thanks again.

View solution in original post

unison · Posted 01-27-2020 10:22 PM

Looks like a similar question was asked here: https://communities.sas.com/t5/Statistical-Procedures/3-way-cross-tabulations-chi-square-tests/td-p/...

-unison

drteju · Posted 01-27-2020 10:24 PM

Hi, thank you for the reply. I have tried that solution but does not work in my case. It gives me an error message that, "data is not sorted in ascending sequence'. Thanks.

unison · Posted 01-27-2020 10:30 PM

Try running this prior to proc freq:

proc sort data=mydata;
by preex_status coverage_status year_Class;
run;

-unison

drteju · Posted 01-27-2020 10:37 PM

Thank you @unison that definitely helped and I could run the code. But i still got 2 tables. Is there anyway i could get one table like below:

	Before	Policy	after	policy
Coverage status	Pre-existing condition	No	Pre-existing condition	No
Yes
No

Thank you again 🙂

unison · Posted 01-27-2020 10:59 PM

What do you want to go in the blank spots?

if it’s just the frequencies then you can use

out=outfreq

On the table statement (next to your chisq option). Then you would do something like this:

proc report data=outfreq;
column sex height, count weight,count;
define sex/group;
define height/across;
define weight/across;
define count/analysis sum;
run;

-unison

drteju · Posted 01-27-2020 11:27 PM

Thanks but that won't work. But i think i might have solved it. I just created a new variable using condition statement and created four categories for the new variable where 1. a person has condition and insurance. 2. Has condition but no insurance 3. No condition but has insurance 4. no condition and no insurance then used this new variable for a two way table with year variable using code:

proc freq data=mythesis.jdata ;

  tables condition_coverage*year_class/ chisq;
  weight perweight1;
run;

and got following output:

Table of condition_coverage by year_class
condition_coverage	year_class
Frequency Percent Row Pct Col Pct	post	pre	Total
nocond_hascovera	3.364E7 21.26 51.85 42.74	3.124E7 19.74 48.15 39.28	6.488E7 41.01
nocond_nocoverag	5818615 3.68 37.69 7.39	9618986 6.08 62.31 12.10	1.544E7 9.76
preex_nocoverage	3492139 2.21 36.03 4.44	6198915 3.92 63.97 7.80	9691053 6.13
prex_hascoverage	3.575E7 22.60 52.41 45.43	3.246E7 20.52 47.59 40.82	6.821E7 43.11
Total	7.87E7 49.74	7.951E7 50.26	1.582E8 100.00

Looks like that solves the question. Will update once i get it verified if this solution is correct. Thanks again.

Statistical Procedures

3 way cross tabulation (chi square test)

Re: 3 way cross tabulation (chi square test)

Re: 3 way cross tabulation (chi square test)

Re: 3 way cross tabulation (chi square test)

Re: 3 way cross tabulation (chi square test)

Re: 3 way cross tabulation (chi square test)

Re: 3 way cross tabulation (chi square test)

Re: 3 way cross tabulation (chi square test)

Cross Tabulation 3 categorical

Chi square test?

chi-square test / compute variable worth

Chi squared test for unequal sample sizes

code for chi square test from cards input

Follow Us

What is...

Statistical Procedures

Our biggest data and AI event of the year.

Follow Us

What is...