Hi,
This can be tested experimentally, but I would like to prime the work with some discussion of variable distributions and primitive vs. advanced normalization. I have a distribution of times to be used in logistic regression. Majority of the times are short, so the distribution is skewed. I want to rank the times (i.e., bin them into groups of equal size) and use them as an ordinal variable.
Is it the same as setting a unit change in time to group size in the UNITS statement?
Ranked times will have a uniform distribution - is it legit to use such variable in PROC LOGISTICS?
Should I be setting some sort of a reference level, or can I use the ranked time as a continuous-ordinal variable?
Many thanks!
... View more