SAS Miner - Impute new scoring data?

giant_wolf00 — Tue, 07 Nov 2017 09:35:18 GMT

Hi,

I'm relatively new to SAS Miner so please forgive me if this is a really stupid question!

I created a model which uses 18 variables in it, of which 9 are imputed variables, due to some columns having a high proportion of nulls. Based on the scoring outputs of the test partition, the created model looked to be fairly predictive (50% of those which had the outcome I was trying to predict, featured in the 10% of scores, c80% in the top 20%).

However, when I came to score some brand new data, whilst the top decile still performed ok (c.50% of the top decile had the outcome I was trying to predict vs. an overall 28%), there were large swathes of records with identical model scores which means some of the "middle" deciles are not performing as expected as they are smeared in the middle. There are similar levels of nulls in this data too and it is these nulls and the fact that I have used imputed columns in the model creation which prompts my question.

When scoring new data - does Miner factor in the previously used impute, or do I need to feed the new data through an impute before scoring too? If the former - could something else be wrong which is causing my issue?

Below is my model diagram - the model is the flow on the right, from data all the way to scoring the test partition. The flow on the left (starting highlighted yellow), is the new data I'm trying to score.

Any help, very gratefully received. Thank you.

Re: SAS Miner - Impute new scoring data?

MikeStockstill — Wed, 08 Nov 2017 21:13:15 GMT

The Score node contains the imputation scoring code that was passed to it by the Impute node. When new data is passed to the Score node for scoring, the imputation code is applied automatically to the new data.

You can see exactly what the Score node code is going to do by viewing the score code itself.

- After the Score node finishes running, right-click the Score node, and select Results.

- In the Results window, select View -> Scoring -> SAS Code. This is the code that is used to score new data.

topic SAS Miner - Impute new scoring data? in SAS Data Science

SAS Miner - Impute new scoring data?

Re: SAS Miner - Impute new scoring data?