08-04-2015 05:04 PM
I want to check categorical variable distribution over a period of time. For example, the distribution of variable district code has changed over a period of time ( Time 0 - Period of model Development, time1 - after one year).
I perform the following analysis for numeric variables.
Step 1 - Rank variable into 10 group (decile)
Step 2- (% of records based on variable in Scoring Sample (A) - % of records based on variable in Training Sample (B)) * In(A/ B)
Step 3 - Then sum up the scores in step2 on 10 groups
I'm little bit skeptical about the same analysis for categorical variables. I guess chi square analysis isn't correct technique to check it. Am i correct? What's the correct way?
08-05-2015 09:02 AM
This categorical variable is nominal or ordered ?
and how many levels does this variable have ?
If two level , you could try TRAND analysis.
tables var * time / trend cl ;
Otherwise , try MEASURE analysis.
tables var* Dose / measures cl;