SAS Programming

DATA Step, Macro, Functions and more
BookmarkSubscribeRSS Feed
holy9ner
Calcite | Level 5

Afternoon everyone, 

 

I was wondering how you go about proving that categorical varaibles are normal. For instance, I have married and not married, but how do i show that my distrobution is normal for this two categorical variables. A p-p and q-q plot will not work as those are supposed to be for continuous variables. Can anyone help me. what are some visual ways to prove that it is normal. I am only asking since we have so show we have our data from a normal distrobution in order to use it. 

 

Thanks

4 REPLIES 4
PGStats
Opal | Level 21

The Normal distribution is for continuous quantities (weight, height, blood pressure). It is not appropriate for ordinal (age group, produce grade, education level) or nominal (gender, race) variables.

PG
holy9ner
Calcite | Level 5

Thanks PG, 

 

when when we are doing a significance test or a difference in portions we only have the numerical test, and no graphs to judge normalcy and independence on for categorical variables

PGStats
Opal | Level 21

What test(s) are you performing. Please share the SAS code.

PG
holy9ner
Calcite | Level 5
*95% confidence interval of the difference between not married and married;
proc freq data = project.termlifepartb;
tables marcat*policy/ norow riskdiff(cl=(wald mn));
run;

*95% significance(hypothesis test) of married and unmarried by yes or no policy;
proc freq data=project.termlifepartb order=data;
tables marcat*policy/ nopercent norow chisq relrisk;
run;

I am doing 95% confidence interval for the difference in marriage category vs having an insurance policy and the second bit of code is for a hypothesis test of  the difference in porportions of marriage category vs having an insurance policy. I was hypothesiing that people who are married will buy more insurance. my second bit of code output has its chi squared as being less than .05 so i reject that there is no difference and say that there is almost a 2.5 times more likely of a chance that if you are married that you wil possess term life insurance. 

 

my issue is there are two conditions to do diference in proportions and they are sample proportions are available based on independent randomly selected samples from two populations and that np, n(1-p) are greater than or equal to 10.  my question is how do i know they are randomly if i have no way to test categorical variables.

 

I attached my file as wel

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 4 replies
  • 3665 views
  • 0 likes
  • 2 in conversation