Good morning all!
I'm doing logistic regression modeling with 7 categorical predictor variables and it seems everything is coming up significant and is being included in the final model. Is this simply due to having a rather large sample size (50k+)?
Thanks,
Brian
That's actually 4+5+5+1+14+1+8+4 = 42 variables but still with 50,000 rows you're likely fine. Assuming no interaction. I would be adding confidence intervals to my estimates and seeing if the effects are large enough to matter.
Reeza,
Thanks for the reply! The eight variables have 5, 6, 6, 2, 15, 2, 9, and 4 levels.
Brian
That's actually 4+5+5+1+14+1+8+4 = 42 variables but still with 50,000 rows you're likely fine. Assuming no interaction. I would be adding confidence intervals to my estimates and seeing if the effects are large enough to matter.
If you are using a WEIGHT statement, this is quite common unless you also use the NORMALIZE option in that statement.
StatDave,
Thanks for the reply. I'm not using the weight statement as my data is not aggregated. I have one line per person and a response variable, comply, that is 0 or 1depending on if the person is in compliance.
Brian
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.