Hi,
I have built a logistic regression model to predict customers who are more likely to click on a marketing offer. My model is not great as I have a AUC=0.58 and 50% capture only 60%. So I have built the model on a training sample (9 months) then kept 2 months of data as sample held out from model dev (Validation) to test my prediction. Once I finished to build the model on the training, I have scored the validation (2 months dataset) then sorted the scored validation, and sliced the target variable and probabilities into deciles ..like below...It doesn't seem that my model is ranking well ...Is it because the AUC is low..so the predictive power is not great? Your help will be much appreciated....Many Thanks
proc rank data=VALIDATION (KEEP=CLICK_FLAG EM_EVENTPROBABILITY) groups=10 out=pred_valid;
var EM_EVENTPROBABILITY; ranks decile;
run;
data pred_valid;
set pred_valid;
decile=decile+1;
run;
proc sort data=pred_valid;
by descending EM_EVENTPROBABILITY;
run;
proc means data=pred_valid n mean sum;
var CLICK_FLAG;
class decile;
run;
proc means data=pred_validn mean sum;
var EM_EVENTPROBABILITY;
class decile;
run;
| Decile | Training Response Rate | Training Mean_Score | Validation Response Rate | Validation Mean_Score |
| 10 | 10% | 0.10 | 6% | 0.10 |
| 9 | 9% | 0.08 | 6% | 0.08 |
| 8 | 7% | 0.07 | 7% | 0.07 |
| 7 | 6% | 0.06 | 7% | 0.06 |
| 6 | 6% | 0.06 | 7% | 0.06 |
| 5 | 6% | 0.06 | 7% | 0.06 |
| 4 | 6% | 0.05 | 7% | 0.05 |
| 3 | 5% | 0.05 | 6% | 0.05 |
| 2 | 5% | 0.05 | 5% | 0.05 |
| 1 | 4% | 0.03 | 7% | 0.03 |
Kanyange, would you mind sharing your modeling code? It would help to understand the options you selected, along with a snapshot of the output.
Thanks,
Jonathan
Hi Jonathan,
Thank you for your reply,...yes sure , I have attached the Code and the output...Please let me know if you can open the attachement,....
Cheers
Kanyange, this question would be best posted in the SAS Stat Community
Given your needs with PROC LOGISTIC, they would be best to respond appropriately.
Let me know if you need any more help.
Thanks,
Jonathan
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Still thinking about your presentation idea? The submission deadline has been extended to Friday, Nov. 14, at 11:59 p.m. ET.
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.