Hello @AlainX,
@AlainX wrote:
Should I take into account the 0.50 Prob Level in order to compare?
Yes, I think this comes closest to the SPSS table. In both cases the known "(non-)responses" (i.e., values of variable Coupon) in the analysis dataset are compared to the predicted (non-)responses, where a subject is predicted to "respond" if the response probability according to the logistic regression model is >=0.5. I suspect that SPSS uses a different definition of the response probability than SAS and that this explains the small differences in the results: SPSS counts 20 correctly predicted responders, whereas SAS shows 19 in column "Correct Event" for Prob Level 0.500.
Indeed, the definition used by SAS, as described in Predicted Probability of an Event for Classification, does not simply use the estimated coefficients from the "Analysis of Maximum Likelihood Estimates" table. Compare this to the SPSS documentation https://www.ibm.com/docs/en/spss-statistics/SaaS?topic=ucslracr-classification mentioning the "model-predicted logit" (possibly rather the simple definition) and that "[c]ases are weighted by finalweight" (not sure if this applies to your data at all, unless you're dealing with complex survey data).
By using an OUTPUT statement like
output out=pred predprobs=(i x);
in your PROC LOGISTIC step, you can create a dataset PRED that contains both types of response probabilities (variables IP_... use the simple definition, variables XP_... the other one) for each subject.Then you can count the observations where these response probabilities are >=0.5.
@AlainX wrote:
And why we say "Percent Conc/Disc", we are talking about something else?
Exactly. These percentages belong to a different consideration: Each of the possible 40*60=2400 pairs of the 40 responders and the 60 non-responders is assessed with respect to the (simply) predicted response probability: whether that probability is greater for the responder of the pair (meaning "concordance") or for the non-responder ("discordance") or whether there is no difference ("tie"). So the denominator of these percentages is the number of pairs (here: 2400), not a number of subjects as in the classification tables.
... View more