I want to check categorical variable distribution over a period of time. For example, the distribution of variable district code has changed over a period of time ( Time 0 - Period of model Development, time1 - after one year).
I perform the following analysis for numeric variables.
Step 1 - Rank variable into 10 group (decile)
Step 2- (% of records based on variable in Scoring Sample (A) - % of records based on variable in Training Sample (B)) * In(A/ B)
Step 3 - Then sum up the scores in step2 on 10 groups
I'm little bit skeptical about the same analysis for categorical variables. I guess chi square analysis isn't correct technique to check it. Am i correct? What's the correct way?
Take a look at the the options in proc freq, there's a test for trends over time.
Which test?
This categorical variable is nominal or ordered ?
and how many levels does this variable have ?
If two level , you could try TRAND analysis.
tables var * time / trend cl ;
Otherwise , try MEASURE analysis.
tables var* Dose / measures cl;
Xia Keshan
TREND test
Trend test is only for 2*N matrix , if I was right . while MEASURE(Association test) is for N*M matrix.
OP mentioned 2*N - Time before and Time after measurements.
Available on demand!
Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.