Building models with SAS Enterprise Miner, SAS Factory Miner, SAS Visual Data Mining and Machine Learning or just with programming

Data Mining without using EM

Posts: 57

Data Mining without using EM

Called SAS and found out EM is prohibitively expensive for individual users. I wonder if I can produce the same results using SAS/STAT for data mining problems.
SAS Employee
Posts: 31

Re: Data Mining without using EM

Hello Aha,

Please contact me at to discuss EM pricing relative to your budget and application needs. I am here to listen to your concerns at a minimum.

STAT provides a broad range of predictive/classifciation methods along with clustering and many other techniques. EM provide some algorithms like decision trees, gradient boosting, neural networks, memory based reasoning , etc not found in STAT along with the GUI for developing repeatable self documenting analysis very quickly with an emphasis on complete score code generation for deployment. Others from the list may have more practical reasons -- I work at SAS. Anway we hope you are using SAS and STAT is a great product --- EM requires it and includes the SAS code node for embeding STAT procedures into your EM analysis.

Occasional Contributor
Posts: 17

Re: Data Mining without using EM

you can find some popular Data Mining algorithms in STAT, such as KNN, LDA, CDA, various regression models, (regularized) Logistic, LASSO (v9.2), disjoint partition clustering and hierarchical clustering, SVD&PCA, Bagging, etc.

With some tweak, you can also implement algorithms such as AdaBoost, LSI & vector space model in TextMining, the efficiency is not on par with EM, though. consult my blog for more:

You get extra that is not available in STAT, such as Neural Network, CART, etc
Posts: 0

Re: Data Mining without using EM

Do you know if EM can do the Random Forest method for decision trees? I've studied it a little, and it's kind of cool.
Posts: 24

Re: Data Mining without using EM


@OP: I believe that there are many of us who have written routines and sub-routines which can be used to produce similar results to the ones in Enterprise Miner. However, the computational time will become prohibitively high as the number of observations and variables increases. A suggestion is that to check whether your data is big enough to justify the use of EM.

@JQ: I believe you can do it via PROC SURVEYSELECT mixed with recursive partitioning algorithms.

Ask a Question
Discussion stats
  • 4 replies
  • 5 in conversation