Statistical Procedures

sachin01663 · Posted 01-23-2018 04:35 AM

Hello,

I am doing the logistic regression and ranking the final output. In the output, when I rank the observations into deciles, I dont get equal number of obs in each decile. Infact, in some deciles its as low as 35 and highest in a decile is 150. I understand the ranking is done based on predicticted probabilities, but I am wondering is this an indication of wrong prediction or ranking as most of the obs have exactly same prediction score?

Can it be becuase I am using a very few predictors (4 to 5) and all of them are categorical, hence many obs have exactly same predicted probabilities?

Thanks as always.

Sachin

Reeza · Posted 01-23-2018 10:48 AM

Can it be becuase I am using a very few predictors (4 to 5) and all of them are categorical, hence many obs have exactly same predicted probabilities?

Yes, if you have few predictors that are all categorical you only have a few values.

For example, if you have 3 predictors with 3 levels each, then you have a total of 3*3 = 9 possible predicted values total.

View solution in original post

PaigeMiller · Posted 01-23-2018 08:17 AM

How are you doing the ranking? Are there ties in the data?

--
Paige Miller

ballardw · Posted 01-23-2018 10:43 AM

@sachin01663 wrote:

Hello,

I am doing the logistic regression and ranking the final output. In the output, when I rank the observations into deciles, I dont get equal number of obs in each decile. Infact, in some deciles its as low as 35 and highest in a decile is 150. I understand the ranking is done based on predicticted probabilities, but I am wondering is this an indication of wrong prediction or ranking as most of the obs have exactly same prediction score?

Can it be becuase I am using a very few predictors (4 to 5) and all of them are categorical, hence many obs have exactly same predicted probabilities?

Thanks as always.

Sachin

How are you ranking "the final output"? It might help to show the code used.

Reeza · Posted 01-23-2018 10:48 AM

Can it be becuase I am using a very few predictors (4 to 5) and all of them are categorical, hence many obs have exactly same predicted probabilities?

Yes, if you have few predictors that are all categorical you only have a few values.

For example, if you have 3 predictors with 3 levels each, then you have a total of 3*3 = 9 possible predicted values total.

StatDave · Posted 01-23-2018 02:36 PM

There is no reason to expect that the decile groups, determined using the predicted probabilities, will have equal sizes. It is not an indication of any problem.

sachin01663 · Posted 01-29-2018 10:58 AM

Thanks everyone.

I was expecting the same as I have very few predictors with less levels hence quite a few customers are getting same probability score.

Appreciate your time !!

Sachin

Statistical Procedures

Not equal volume in Deciles - Logistic Regression

Re: Not equal volume in Deciles - Logistic Regression

Re: Not equal volume in Deciles - Logistic Regression

Re: Not equal volume in Deciles - Logistic Regression

Re: Not equal volume in Deciles - Logistic Regression

Re: Not equal volume in Deciles - Logistic Regression

Re: Not equal volume in Deciles - Logistic Regression

Logistic Regression

score new data-logistic regression

Logistic regression- restore in permanent library

A Guide to Logistic Regression in SAS

Data-Driven Analytics in SAS Viya – Logistic Regression Lift and ROC C...

Follow Us

What is...

Statistical Procedures

Join us for our biggest event of the year!

Follow Us

What is...