About mnil

mnil · ‎06-26-2017

Just realized through trial and error that Neural Network node in SAS EM 6.2 can not handle missing values. Imputing the missing values in my data resolved the issue. Would be greatful if someone can provide any SAS documentation on how NN node in SAS EM processes missing data.

mnil · ‎06-23-2017

Hello, I have a data for prediction (binary target) with 80K observation and 100 input variables. Methods like Gradient Boosting fit the data quite well with a validation Gini of over 70%. When I fit a Neural Network with all 100 variables, I get a Gini of around 15% (both training and validation). When I do a variable selection and use 25-odd variables in NN, the validation Gini increases to 30% - which is still materially worse than the other models. I tried the default NN in EM 6.2 with the following changes: 1. Architecture : MLP 2. #Hidden Units : 2, 3, 5, 10, 20 3. Decay : 0, 0.05, 0.1, 0.5, 1, 5, 10, 25, 50. Decay seems to hardly impact model performance. 4. Standardization : Standard Deviation and Range 5. Sufficient #iterations to ensure model convergence. No other changes to optimization properties. (Have also tried to play with some other properties like RBU, Act Function, Combination functions, direct connections etc without any material change in model) Clearly the model is converging to a local minima; the 25-variable model to a slightly better minima. Am I missing some basic setting/feature which is leading to such poor NN models? Thanks, Nil

mnil · ‎11-25-2015

Hi Miguel, Many thanks for your post. Basically, to provide a background, suppose for each split I need to check if some business rules/tests are satisfied before moving on. This is typically done in a separate SAS session. An example would be a Basel PD model, where the modeler would like to check if the (n-way) split maintains the same ordering over 5 years of historical data. If the test result is ok, you move on to the next node. If not, you try the next most significant variable. However, the first variable with (n-1)-way split may be better in terms of log-worth than this second variable and also pass the offline test. Please see below the response to the two options you shared. Option 1 : Is it a good alternative to run several trees non-interactively, and just compare their fit statistics on an unbiased test partition? Response : I am afraid no. As noted above, the objective is not to compare 3-way vs 2-way trees. Rather, we need to compae 3-way vs 2-way splits for a specific node or subtree (or business segment). Option 2 : Something that should definetly help is that someone demonstrated that trees that only have 2-way splits are less likely to overfit. Response : Please share the document. Overfitting can be addressed by meaningfully pruning the tree using a hold-out sample. The challenge with using 2-way trees is they are typically much deeper (ie with more levels). The lower levels are difficult to interpret. Also, would you know of any literature on the stability of a tree with many levels from one sample to another? Broadly, for building an interactive tree it is critical the modeler has the option to build custom sub-trees against business rules/tests or business segments. If this feature is resource intensive, then the user can always decide not to use it for a short project. Thanks, Nil

mnil · ‎11-23-2015

Hello, In the interactive decision tree window in SAS Enterprise Miner, it was possible to set different tree propoerties for different nodes in older versions (5.1) of SAS EM. This was particularly useful for developing custom sub-trees for various business segments. With EM 6.1 and later, this option has been withdrawn. Consequently, the user has to create multiple 'tree's in the 'diagram' and open individual 'interactive' windows for each one. Additionally, if I am unsatisfied with the 4-way split with variable X1, and want to look at the best 3-way split, then I need to close the interative window, and start afresh. Lastly, for large datasets, these cases leads to increased run-time and delays. Thanks, PS: Refer attached screen shots for details

Online Status	Offline
Date Last Visited	‎07-13-2017 05:19 AM

Re: Neural Network in EM 6.2 performing materially worse than other ap...

Neural Network in EM 6.2 performing materially worse than other approa...

Re: Change Decision Tree Properties in the 'interactive' window of SAS...

Change Decision Tree Properties in the 'interactive' window of SAS Ent...

Re: Neural Network in EM 6.2 performing materially worse than other ap...

Neural Network in EM 6.2 performing materially worse than other approa...

Re: Change Decision Tree Properties in the 'interactive' window of SAS...

Change Decision Tree Properties in the 'interactive' window of SAS Ent...