Good morning all!
I'm doing logistic regression modeling with 7 categorical predictor variables and it seems everything is coming up significant and is being included in the final model. Is this simply due to having a rather large sample size (50k+)?
Thanks,
Brian
That's actually 4+5+5+1+14+1+8+4 = 42 variables but still with 50,000 rows you're likely fine. Assuming no interaction. I would be adding confidence intervals to my estimates and seeing if the effects are large enough to matter.
Reeza,
Thanks for the reply! The eight variables have 5, 6, 6, 2, 15, 2, 9, and 4 levels.
Brian
That's actually 4+5+5+1+14+1+8+4 = 42 variables but still with 50,000 rows you're likely fine. Assuming no interaction. I would be adding confidence intervals to my estimates and seeing if the effects are large enough to matter.
If you are using a WEIGHT statement, this is quite common unless you also use the NORMALIZE option in that statement.
StatDave,
Thanks for the reply. I'm not using the weight statement as my data is not aggregated. I have one line per person and a response variable, comply, that is 0 or 1depending on if the person is in compliance.
Brian
Available on demand!
Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.