I have used start groups and end groups nodes to perform 5-fold cross validation on a modelling node in SAS EM, grouping on a random variable in my training data which I created for this purpose. I now wish to use the model I have created to score up a new dataset. When I export the scoring code I can see that it is referencing the random variable that I created for the purpose of cross-validation in the scoring code, but this variable is not present in my new data as it was only created for the purpose of the cross-validation. Unless I am mis-reading the code it appears to use the value of the random variable to score each of the 5 segments of the data differently. The datasets which I am scoring in the live environment could be fairly small (only a fewthousand records at a time so I don't feel that this would be appropriate) How do I apply the scoring code to my new data so that every observation is scored consistently?
... View more