Good afternoon everyone. I’m very excited to share details about our updated version of SAS Visual Data Mining and Machine Learning on SAS Viya.
Before I dive into the details, it’s important to understand the thinking behind this revolutionary release. As the need for machine learning as skyrocketed, so has the need to access methods from multiple entry points. Organizations are often made up of a diverse set of individuals with varying backgrounds in computer science, statistics, machine learning, and business. Accompanying these backgrounds is the myriad of analytical ‘tools’ that you need to solve modern business problems. Examples may include SAS, which includes the SAS language and our graphical users interfaces. You may have a background in Python, R, Java or Lua. You may even be an application developer who wants to build applications from the ground-up using APIs. No matter your skillset, you should be able to use your language and interface of choice.
Welcome SAS Visual Data Mining and Machine Learning on SAS Viya! At its core, SAS Viya is built upon a common in-memory analytic framework, using ‘actions’. These actions are atomic analytic activities, such as selecting variables, building models, generating results, and outputting score code. The actions or components are modular by design. We have exposed these actions via SAS Procedures, Python, Java, Lua and RESTful APIs (R will be released soon). No matter the language or interrace, you WILL get the same answers for the same actions, whether you use a procedure or a python call into SAS Viya. Start your analysis in SAS, and then continue it in Python, all using the same in-memory data – no duplication.
We have also exposed these analytic actions with the Visual Analytics framework. Now you can build a two-layer Neural Network using LBFGS and compare it to a Gradient Boosting model with 500 trees, all within an interactive environment.
There’s so much more to this offering. We’ve enhanced our capabilities in autotuning so that you intelligently search the hyperparameter space for the best combination of values that addresses the model objective – that is, misclassification, Lift, KS, and so on. We’ve added in capabilities in high-frequency analytics like Robust PCA (RPCA), Moving Window PCA, and the capability to detect outliers using Support Vector Data Description (SVDD).
Feel free to come meet with myself and other folks in R&D at SAS Global Forum this year. You will see this exciting new update on full display, and in many whitepapers to follow.
P.S. the complete set of analytics supported in this release are as follows:
Data Wrangling |
Modeling |
Binning |
Logistic Regression |
Cardinality |
Linear Regression |
Imputation |
Generalized Linear Models |
Transformations |
Nonlinear Regression |
Transpose |
Ordinary Least Squares Regression |
SQL |
Partial Least Squares Regression |
Sampling |
Quantile Regression |
Variable Selection |
Decision Trees |
Principal Components Analysis (PCA) |
Forest |
K-Means Clustering |
Gradient Boosting |
Moving Window PCA |
Neural Network |
Robust PCA |
Support Vector Machines |
|
Factorization Machines |
|
Network / Community Detection |
|
Text Mining |
|
Support Vector Data Description |
Great post, Jonathan.
Thanks,
Doug
Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.
Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.