About gabon

DougWielenga · ‎08-08-2017

SAS Enterprise Miner allows you to build multiple parallel modeling paths which you can connect back to a common Model Comparison node. You can also connect multiple sampling nodes to the same Input Data Source and build separate paths through multiple models from each and then compare all of the models in a Model Comparison node. Comparing competing models is very straightforward when all of the data comes from the same data set. Combining different sampling strategies and modeling strategies for a specific data set confounds the results in that you cannot be sure whether the impact is due to the sampling strategy or the modeling strategy. Are there number of observations the same in all samples? If not, comparing the models becomes more problematic. Are you using the same Training and Validate (and possibly Test) data set to sample from? If not, you are comparing different models on data from different populations which makes the results less clear. Since SAS Enterprise Miner is intended for modeling extremely large data sets, it is often not necessary to sample at all except in extremely rare event scenarios. There are analytical scenarios which can benefit from sampling but the best sampling strategy and modeling strategy for a given data set will not necessarily provide the best strategy for another data set. SAS Enterprise Miner provides a wealth of modeling methods precisely because different data sets can be 'best' solved by different modeling strategies. By 'best', I mean taking the business objective into account. Is the analyst hoping for more interpretability or simply looking for the best prediction according to some metric? Are the models which perform 'best' including variables which are inexpensive or difficult to come by when a simple model using readily available data performs just as well? The challenge in any scenario is that depending on your definition of 'best', you might get very different answers. Hope this helps! Doug

Ksharp · ‎06-16-2017

http://support.sas.com/kb/24/188.html

gabon · ‎02-08-2017

Thanks a lot for letting me know, I'm looking forward to it 🙂

gabon · ‎02-08-2017

Thank you very much! 🙂

Ksharp · ‎10-20-2016

OK. Here is my example . written by me,Matt and Arthur.T. http://support.sas.com/resources/papers/proceedings15/2785-2015.pdf

Maggie · ‎09-20-2015

Hi Gabon, Could you please show how you specified FREQ freq statement in proc tabulate? I am stuggling with exact same problem here. My proc tabulate reqult is diff from that of proc freq.

gabon · ‎02-09-2015

Thanks, at first I tried to recode the data with hundreds and zeros to get my mean correct, but after your advice that formatting can help, I found proc template for proc means and changed format for mean to percent and it worked, so I can have my data recoded to ones and zeros.

gabon · ‎02-08-2015

Thanks, it looks like it offers much more customization than proc freq, I will start using it more.

Online Status	Offline
Date Last Visited	‎06-21-2017 07:04 AM

Need help with GLMs and predictive modeling for car insurance data!

Re: Is there any way to implement one-class SVM for anomaly detection ...

Re: How to visualize and evaluate the results from the score node in E...

Is there any way to implement one-class SVM for anomaly detection in E...

How to visualize and evaluate the results from the score node in EM 14...

How to create and organize SAS EM 14.1 workflow for academic compariso...

Re: Proc tabulate with percent that do not add up to 100%?

Re: Proc tabulate with percent that do not add up to 100%?

Proc tabulate with percent that do not add up to 100%?

Re: Proc freq with colpercent but without percent

Re: Need help with GLMs and predictive modeling for car insurance data...

Re: Store variable names and labels in data set and assign labels auto...

Re: How to create and organize SAS EM 14.1 workflow for academic compa...

Re: Need help with GLMs and predictive modeling for car insurance data...

Re: Is there any way to implement one-class SVM for anomaly detection ...

Re: How to visualize and evaluate the results from the score node in E...

Re: Store variable names and labels in data set and assign labels auto...

Re: Proc freq vs proc tabulate, different results for colpctn?

Re: Proc tabulate with percent that do not add up to 100%?

Re: Proc freq with colpercent but without percent