About StuartE

StuartE · ‎05-26-2017

Hi Funda_SAS I should have clarified my question; what I am trying to understand is how within a specific modelling node (e.g. a regression node) does the modelling node utilise the training and validation data to arrive at the chosen model? Given the case where I partition my data 60/40 training/validation and no test data, when I pass the data to a modelling node (regression, decision tree, neural network etc), I am guessing EM will use the combination of training and validation data to iteratively select the best model (i.e. training the model on the training data and using the validation data to generalise the model and avoid overfitting - you mention hyperparameter tuning in your reply). This is before the result is sent to a model comparison node to select the best from a range of models. So my question really is about what goes on with the training and validation data sets within an individual modelling node and how does this differ from other techniques such as cross validation?

StuartE · ‎05-21-2017

When using SAS Enterprise Miner to perform logistic regression on partitioned data, how does EMiner select the "best" model? Assuming you have partitioned data into Training and Validation data sets (and have selected Validation Misclassification Error as the metric to optimise - ignore test data sets for now), how does EMiner iterate through the two data sets to arrive at the best model? How does the default methodology compare to or differ from other model training techniques such as k-fold cross validation, and what would be the equivalent methodology if modelling in R?

Online Status	Offline
Date Last Visited	‎05-27-2017 11:40 PM

Re: How does SAS Enterprise Miner select the best model in a regressio...

How does SAS Enterprise Miner select the best model in a regression no...

Re: How does SAS Enterprise Miner select the best model in a regressio...

Re: How does SAS Enterprise Miner select the best model in a regressio...

How does SAS Enterprise Miner select the best model in a regression no...