Solved: Re: Decision Tree Output- difference between "Validated:Target_B=1" an...

ravi4 · Posted 12-19-2016 09:48 AM

Hello,
I'd be grateful to have some clarity regarding these two variables that are created after running a Decision Tree to predict a binary target,-
1) label "Validated:Target_B=1" with Variable name "V_TARGET_B1" and,
2)label "Predicted:Target_B=1" with Variable name "P_TARGET_B1"
There seems to be a small difference between the above two p values mostly corresponding to the third digit after the decimal. What is the difference between these variables. I find them in both the training and the validation datasets of the output.
Thanks in advance,
Ravi.

WendyCzika · Posted 12-19-2016 04:21 PM

The V_ variables are the predictions based on the validation data - so the proportion of validation obs. in the leaf with target=B1 for whatever leaf the current observation is in. The P_ is the same thing but using the training partition, and the actual prediction is based on this value.

View solution in original post

WendyCzika · Posted 12-19-2016 04:21 PM

The V_ variables are the predictions based on the validation data - so the proportion of validation obs. in the leaf with target=B1 for whatever leaf the current observation is in. The P_ is the same thing but using the training partition, and the actual prediction is based on this value.

Decision Tree Output- difference between "Validated:Target_B=1" and "Predicted: Target _B=1"

Re: Decision Tree Output- difference between "Validated:Target_B=1" and "Predicted: T

Re: Decision Tree Output- difference between "Validated:Target_B=1" and "Predicted: T