BookmarkSubscribeRSS Feed
nathanb_1993
Calcite | Level 5

Hi,

 

I have a dataset which has been created with both predicted and actual values from a logistic regression model (the only other information in the dataset are the original inputs into the model). 

 

The dataset was already created so no PROC LOGISTIC / GLM has been used.

 

I know all of the individual parameters in the model and their estimates, however I need to do some analysis on the residuals which are not included in the dataset.

 

Probably a longshot, but does anyone know how I could compute the residuals from these logistic regression model estimates and actuals?

 

Thanks!

 

Nathan

5 REPLIES 5
SAS_Rob
SAS Employee

I assume when you say you have the predicted values these are not predicted probabilities but some kind of classification based on the predicted probability.  If it is actually the predicted probability then you should be able to compute the residual in a data step.

 

If my assumption is correct then you could put the parameter estimates into a data set that is in the correct format for use as input into Proc LOGISTIC with the INEST= option and MAXITER=0.  You could then use the OUTPUT statement to recreate the predicted probabilities/residuals. 

 

You could also use one of the methods here as well http://support.sas.com/kb/33/307.html

 

 

 

Reeza
Super User

Residuals are actuals - estimate. 

nathanb_1993
Calcite | Level 5
Hi Reeza,

Agreed, though due to this being a logistic regression, rather than simple liner, can it be computed in a manner as simple as this?

I actually think my statistical knowledge is getting all a bit confused!
Reeza
Super User

I believe so, but if you're really, really concerned, test it. Run one of the demo codes for logistic regressions and see if the residuals are the same calculating it both ways.

StatDave
SAS Super FREQ

You say you have the actual response values and all the inputs, which suggests you can simply refit the model in PROC LOGISTIC and use the OUTPUT statement to generate the regression diagnostics of interest. 

SAS Innovate 2025: Register Now

Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 2426 views
  • 0 likes
  • 4 in conversation