BookmarkSubscribeRSS Feed
Zachary
Obsidian | Level 7

I am using the Save Data node in Enteprise Miner. Both the P_DV & V_DV variables are created. P_DV is the predicted value of the dependent variable. Likewise, V_DV is the validation value of the dependent variable, but I am not so sure as to why they would be different on a record basis.

But what is the R_DV variable?

Thank you.

1 REPLY 1
WendyCzika
SAS Employee

R_DV is the residual, DV - P_DV.

And just a little more about V_DV since that can be confusing.  The V_ variables are only calculated for decision trees since for all other models, the V_ and P_ columns would be the same.  For decisions trees, V_ represents the proportion of validation obs in the same leaf as the current obs. that have a certain target level for a classification tree, or the mean of the target for the validation obs. in the same leaf as the current obs. for a regression tree.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1253 views
  • 0 likes
  • 2 in conversation