Hi,
I am working in a logistic regression using 'proc logistic'. One of the input variables is 'age' (from 18 to 78). I have created a new var 'age_levels' that is the result of making ranges from 'age' using a clustering, it has 4 levels.
Using proc freq I can see that there is a dependency between 'age_levels' and the target var of the model.
My doubt is wheter var I have to use in the model of proc logistic , 'age' or 'age_levels' as input var.
Can anybody help me??
Thanks
You can use any variable(s) that you think are appropriate in such a model.
I usually avoid clustering of input variables such as age into four levels. This seems to me to throw away potentially useful information that is contained in age.
Available on demand!
Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.