BookmarkSubscribeRSS Feed
aha123
Obsidian | Level 7
Called SAS and found out EM is prohibitively expensive for individual users. I wonder if I can produce the same results using SAS/STAT for data mining problems.
4 REPLIES 4
WayneThompson
SAS Employee
Hello Aha,

Please contact me at wayne.thompson@sas.com to discuss EM pricing relative to your budget and application needs. I am here to listen to your concerns at a minimum.

STAT provides a broad range of predictive/classifciation methods along with clustering and many other techniques. EM provide some algorithms like decision trees, gradient boosting, neural networks, memory based reasoning , etc not found in STAT along with the GUI for developing repeatable self documenting analysis very quickly with an emphasis on complete score code generation for deployment. Others from the list may have more practical reasons -- I work at SAS. Anway we hope you are using SAS and STAT is a great product --- EM requires it and includes the SAS code node for embeding STAT procedures into your EM analysis.

Wayne
oloolo
Fluorite | Level 6
you can find some popular Data Mining algorithms in STAT, such as KNN, LDA, CDA, various regression models, (regularized) Logistic, LASSO (v9.2), disjoint partition clustering and hierarchical clustering, SVD&PCA, Bagging, etc.

With some tweak, you can also implement algorithms such as AdaBoost, LSI & vector space model in TextMining, the efficiency is not on par with EM, though. consult my blog for more:
http://www.sas-programming.com

You get extra that is not available in STAT, such as Neural Network, CART, etc
deleted_user
Not applicable
Do you know if EM can do the Random Forest method for decision trees? I've studied it a little, and it's kind of cool.
goladin
Calcite | Level 5
Hi,

@OP: I believe that there are many of us who have written routines and sub-routines which can be used to produce similar results to the ones in Enterprise Miner. However, the computational time will become prohibitively high as the number of observations and variables increases. A suggestion is that to check whether your data is big enough to justify the use of EM.

@JQ: I believe you can do it via PROC SURVEYSELECT mixed with recursive partitioning algorithms.

Regards,
Murphy

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 1722 views
  • 0 likes
  • 5 in conversation