Building models with SAS Enterprise Miner, SAS Factory Miner, SAS Visual Data Mining and Machine Learning or just with programming

Predictive modelling on non-random data

Occasional Learner
Posts: 1

Predictive modelling on non-random data

Hi all

I am using predictive modelling to predict which customers to send out email newsletters to. I am using e.g. decision trees, logistic regression etc.

I have some datasets where all the customers (one ID per customer) have received a newsletter, which means I can do a simple random sample based on this newsletter. However, I also have datasets where the newsletter where sent out to only a subset of customers based on a predefined affinity logic. How would you deal with such a dataset? Is it even possible to use it for predictive modelling as it will not have a random representation of people who are interested in the newsletter and people who are not.

Second, how would I keep validating a model that has been created? Because the data I will now have is going to be non-random as it is based upon the model I have created. When I then want to improve the model to validate whether the model is still stable or should be changed, I guess I need to sent out the newsletter to a random selection of those who otherwise are predicted to not receive the newsletter?

Thanks a lot for your help in advance,


Super Contributor
Posts: 336

Re: Predictive modelling on non-random data

You can do an incremental response model (a.k.a. net lift model) for the data with campains when you only sent the offer to certain customers.


A paper that uses direct marketing as an example:

Net Lift Model for Effective Direct Marketing Campaigns at


A paper that goes a little deeper into theroy and an example:

Incremental Response Modeling Using SAS® Enterprise Miner™


I hope this helps!



Ask a Question
Discussion stats
  • 1 reply
  • 2 in conversation