BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
sachin01663
Obsidian | Level 7

Hello,

 

I am doing the logistic regression and ranking the final output. In the output, when I rank the observations into deciles, I dont get equal number of obs in each decile. Infact, in some deciles its as low as 35 and highest in a decile is 150. I understand the ranking is done based on predicticted probabilities, but I am wondering is this an indication of wrong prediction or ranking as most of the obs have exactly same prediction score?

 

Can it be becuase I am using a very few predictors  (4 to 5) and all of them are categorical, hence many obs have exactly same predicted probabilities?

 

Thanks as always.

Sachin

 

1 ACCEPTED SOLUTION

Accepted Solutions
Reeza
Super User

Can it be becuase I am using a very few predictors  (4 to 5) and all of them are categorical, hence many obs have exactly same predicted probabilities?

 

Yes, if you have few predictors that are all categorical you only have a few values. 

For example, if you have 3 predictors with 3 levels each, then you have  a total of 3*3 = 9 possible predicted values total.

 

View solution in original post

5 REPLIES 5
PaigeMiller
Diamond | Level 26

How are you doing the ranking? Are there ties in the data?

 

 

--
Paige Miller
ballardw
Super User

@sachin01663 wrote:

Hello,

 

I am doing the logistic regression and ranking the final output. In the output, when I rank the observations into deciles, I dont get equal number of obs in each decile. Infact, in some deciles its as low as 35 and highest in a decile is 150. I understand the ranking is done based on predicticted probabilities, but I am wondering is this an indication of wrong prediction or ranking as most of the obs have exactly same prediction score?

 

Can it be becuase I am using a very few predictors  (4 to 5) and all of them are categorical, hence many obs have exactly same predicted probabilities?

 

Thanks as always.

Sachin

 


How are you ranking "the final output"? It might help to show the code used.

Reeza
Super User

Can it be becuase I am using a very few predictors  (4 to 5) and all of them are categorical, hence many obs have exactly same predicted probabilities?

 

Yes, if you have few predictors that are all categorical you only have a few values. 

For example, if you have 3 predictors with 3 levels each, then you have  a total of 3*3 = 9 possible predicted values total.

 

StatDave
SAS Super FREQ

There is no reason to expect that the decile groups, determined using the predicted probabilities, will have equal sizes. It is not an indication of any problem. 

sachin01663
Obsidian | Level 7

Thanks everyone.

 

I was expecting the same as I have very few predictors with less levels hence quite a few customers are getting same probability score.

 

Appreciate your time !!

Sachin

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 1664 views
  • 4 likes
  • 5 in conversation