08-14-2015 02:26 PM
I am using the Save Data node in Enteprise Miner. Both the P_DV & V_DV variables are created. P_DV is the predicted value of the dependent variable. Likewise, V_DV is the validation value of the dependent variable, but I am not so sure as to why they would be different on a record basis.
But what is the R_DV variable?
08-14-2015 03:14 PM
R_DV is the residual, DV - P_DV.
And just a little more about V_DV since that can be confusing. The V_ variables are only calculated for decision trees since for all other models, the V_ and P_ columns would be the same. For decisions trees, V_ represents the proportion of validation obs in the same leaf as the current obs. that have a certain target level for a classification tree, or the mean of the target for the validation obs. in the same leaf as the current obs. for a regression tree.