BookmarkSubscribeRSS Feed
mnil
Calcite | Level 5

Hello,

 

I have a data for prediction (binary target) with 80K observation and 100 input variables. Methods like Gradient Boosting fit the data quite well with a validation Gini of over 70%.  When I fit a Neural Network with all 100 variables, I get a Gini of around 15% (both training and validation). When I do a variable selection and use 25-odd variables in NN, the validation Gini increases to 30% - which is still materially worse than the other models. I tried the default NN in EM 6.2 with the following changes:

 

1. Architecture : MLP

2. #Hidden Units : 2, 3, 5, 10, 20

3. Decay : 0, 0.05, 0.1, 0.5, 1, 5, 10, 25, 50. Decay seems to hardly impact model performance.

4. Standardization : Standard Deviation and Range

5. Sufficient #iterations to ensure model convergence. No other changes to optimization properties.

(Have also tried to play with some other properties like RBU, Act Function, Combination functions, direct connections etc without any material change in model)

 

Clearly the model is converging to a local minima; the 25-variable model to a slightly better minima. Am I missing some basic setting/feature which is leading to such poor NN models?

 

Thanks,

Nil

1 REPLY 1
mnil
Calcite | Level 5

Just realized through trial and error that Neural Network node in SAS EM 6.2 can not handle missing values. Imputing the missing values in my data resolved the issue. Would be greatful if someone can provide any SAS documentation on how NN node in SAS EM processes missing data.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 622 views
  • 0 likes
  • 1 in conversation