05-22-2024
MelodieRush
SAS Employee
Member since
03-27-2012
- 169 Posts
- 324 Likes Given
- 31 Solutions
- 96 Likes Received
-
Latest posts by MelodieRush
Subject Views Posted 1178 01-10-2023 01:49 PM 2996 12-13-2022 02:55 PM 4179 06-16-2022 04:12 PM 3012 10-24-2021 12:18 PM 1180 07-29-2021 02:51 PM 889 06-17-2021 04:05 PM 719 06-17-2021 04:00 PM 4645 06-17-2021 03:58 PM 1040 06-03-2021 10:01 AM 1108 06-02-2021 05:24 PM -
Activity Feed for MelodieRush
- Got a Like for Re: unlock diagram in SAS E-Miner. 11-13-2024 11:09 PM
- Liked Enforce Responsible AI Best Practices: Trustworthy AI Life Cycle Workflow Available for SophiaRowland. 01-23-2024 01:35 PM
- Liked 2024 Customer Awards: Merck & Co., Inc - Innovative Problem Solver for yogesh2778. 01-22-2024 10:58 AM
- Liked SAS Studio Custom Steps - Coding Made Easier Using the Column Selector Control for GemmaR. 11-27-2023 10:18 AM
- Liked 3rd Place Winner - 2023 Customer Awards: ICON - Curious Thinker for DethWench. 06-14-2023 11:09 AM
- Liked 1st Place Winner - 2023 Customer Awards: Gilead Sciences - Community Uplift Award for sachinsontakke. 06-14-2023 11:09 AM
- Liked 2023 Customer Awards: Parexel - Curious Thinker for Bal23. 06-14-2023 11:08 AM
- Liked 2023 Customer Awards: Georgia Pacific - Curious Thinker for slcoyne86. 06-14-2023 11:07 AM
- Liked 3rd Place Winner - 2023 Customer Awards: Optum Serve - Innovative Problem Solver for bradylowe. 06-14-2023 11:07 AM
- Liked 2nd Place Winner - 2023 Customer Awards: Taurex Drill Bits - Rookie of the Year for DefaultDevers. 06-12-2023 03:38 PM
- Liked 1st Place Winner - 2023 Customer Awards: Blue Shield of California - Visual Storyteller for daisy_sumo. 06-12-2023 03:38 PM
- Liked 2023 Customer Awards: CNO Financial Group, Inc. - Innovative Problem Solver for Lquandt. 06-12-2023 02:43 PM
- Liked 2023 Customer Awards: USAA - Innovative Problem Solver for ntornabene. 06-12-2023 02:41 PM
- Liked 2023 Customer Awards: T-Mobile USA, Inc - Innovative Problem Solver for littleleaf2. 06-12-2023 02:40 PM
- Posted Getting Started with SAS® Visual Data Mining & Machine Learning Q&A, Slides, & On-Demand Recording on Ask the Expert. 01-10-2023 01:49 PM
- Got a Like for Free Webinar: Getting Started With SAS® Visual Data Mining and Machine Learning. 12-14-2022 03:00 PM
- Got a Like for Free Webinar: Getting Started With SAS® Visual Data Mining and Machine Learning. 12-13-2022 11:58 PM
- Posted Free Webinar: Getting Started With SAS® Visual Data Mining and Machine Learning on SAS Data Science. 12-13-2022 02:55 PM
- Posted How Do I Use Open Source with SAS® Viya®? Q&A, Slides, and On-Demand Recording on Ask the Expert. 06-16-2022 04:12 PM
- Posted Re: SAS EM: does clustering node have elbow method to select the optimal # of clusters? on SAS Data Science. 10-24-2021 12:18 PM
-
Posts I Liked
Subject Likes Author Latest Post 5 64 4 205 346 -
My Liked Posts
Subject Likes Posted 1 02-23-2017 05:21 PM 2 12-13-2022 02:55 PM 3 06-01-2021 10:53 AM 1 10-08-2020 10:05 AM 1 05-26-2020 10:56 AM -
My Library Contributions
Subject Likes Author Latest Post 0 0 2 2 5
10-23-2020
02:45 PM
1 Like
Did you miss the Ask the Expert session on missing data and how to resolve missing values in SAS? Not to worry, you can catch it on-demand at your leisure.
Watch the webinar
Watch this webinar to hear SAS expert Melodie Rush define missing values, why and when they occur and how to manage them. She will discuss functions, procedures and how products like SAS® Enterprise Guide®, SAS® Enterprise Miner™, SAS Studio and SAS® Viya® deal with missing values. During this webinar, you will learn:
The definition of a missing value.
Why missing values happen.
How to manage missing values in SAS.
Please leave a comment on this post and tell us how you handle missing values or how you implemented something new you learned during this webinar. It’s great to learn from fellow SAS users!
Here are the questions from the Q&A segment held at the end of the webinar. The slides from the webinar are attached.
What is the best way to impute categorical data?
Use a mode or a unique category. If using color, use mode and fill in everyone with that color. If doing gender for example, you could make a third category “unknown” for the missing values.
Why would I need imputation indicators?
If you get asked if it’s a real value, you need to know if it is or if you imputed it. That’s important to know so you can evaluate your imputations. It’s also important in predictive modeling. It can help you be more accurate in your predictions.
Can you run PROC STDIZE with a BY statement to apply different missing values to different BY-groups?
Yes: https://support.sas.com/documentation/onlinedoc/stat/132/stdize.pdf
For HPIMPUTE, if you use 'random' then how can you repeat results if the method is random? Problem if repeat the procedure and the imputed values will be different?
It looks like you can specify, or it generates a SEED value: https://go.documentation.sas.com/?cdcId=pgmsascdc&cdcVersion=9.4_3.5&docsetId=prochp&docsetTarget=prochp_hpimpute_overview.htm&locale=en
What is seed in proc MI, which value shall we use?
The seed allows you to replicate the answer you get. PROC MI is randomly imputing those values so if you don’t put a seed in there you won’t be able to replicate the results. If you run the code later, it will give you the same answer. It will allow you to start at the same place. I usually use 1234.
Any positive # - to provide repeatable results across runs.
SEED=number
Specifies a positive integer to start the pseudo-random number generator. The default is a value generated from reading the time of day from the computer’s clock. However, in order to duplicate the results under identical situations, you must use the same value of the seed explicitly in subsequent runs of the MI procedure.
The seed information is displayed in the "Model Information" table so that the results can be reproduced by specifying this seed with the SEED= option. You need to specify the same seed number in the future to reproduce the results.
Is there an easy way in the DATA step to create an indicator column that denotes whether a column was imputed (link you can in EM)?
You can use the missing function. If you have a lot of columns you can use an array to run through all the columns quickly and create a bunch of indicators. Some people create an indicator for the whole row or will create one for each column.
You can post this question out on the SAS Support Community to get other opinions.
SAS Programming - https://communities.sas.com/t5/SAS-Procedures/Creating-an-indicator-variable-based-on-missing-variable-values/td-p/138115
Viya Model Studio - https://communities.sas.com/t5/SAS-Communities-Library/Asked-amp-Answered-How-to-create-missing-value-indicator/ta-p/541156
An Example on creating missing value indicator variables - https://stats.idre.ucla.edu/sas/seminars/multiple-imputation-in-sas/mi_new_1/
For imputation, do you have any guidelines on whether to choose mean, mode, median, etc...?
It depends on your situation. You should try several options and see what works best for your data. It depends on what your outcome is. If it’s creating a predictive model you can try several options and see which one gives you the best prediction.
PROC fastclus as missing replacement method
Good point. In from documentation IMPUTE requests imputation of missing values after the final assignment of observations to clusters. If an observation that is assigned (or would have been assigned) to a cluster has a missing value for variables used in the cluster analysis, the missing value is replaced by the corresponding value in the cluster seed to which the observation is assigned (or would have been assigned). If the observation cannot be assigned to a cluster, missing value replacement depends on whether the NOMISS option is specified. If NOMISS is not specified, missing values are replaced by the mean of all observations in the DATA= data set having a value for that variable. If NOMISS is specified, missing values are replaced by the mean of only observations used in the analysis. (A weighted mean is used if a variable is specified in the WEIGHT statement.) For information about cluster assignment see the section OUT= Data Set. If you specify the IMPUTE option, the imputed values are not used in computing cluster statistics. If you also request an OUT= data set, it contains the imputed values.
Which method is best?
Tough question. It depends. Some of the methods can be used interchangeably. Some methods rely on assumptions - so we need to validate. To use say mean or median, it depends upon your knowledge of the data and its distribution.
Any imputation method relies on some heavy assumptions. Could you touch on some of the pitfalls of using imputation for missing data?
You must be careful. There are pitfalls if delete the row or if you don’t get it correct. Most conservative is to use mean or mode, but if you have a lot missing, that could lead to bad results. If most of data is missing, you should ask if you should use that variable at all. Any data problem is unique. You may need to try several options and see what works best for your data.
What features does SAS have to address cleaning up data in terms of reading different formats of inputs?
Please post this question to the SAS Programming Support Community and you’ll be able to get a lot of good opinions and examples.
In what circumstances would you implement replacement of indicators for missing values?
From a business context does a missing value have a "value" for an explanatory or predictive model.
What method do you recommend for longitudinal missing data? Multiple imputation? What do you think about refreshment samples?
Longitudinal data is related to other data points so when you start imputing you need to use specialized imputation methods. I don’t think multiple imputations will serve you well because it won’t consider the time feature of longitudinal data.
What are your thoughts on time series forecasting and missing dates due to COVID Business shut down? What are the best practices for the missing dates?
SAS/ETS and other time series analysis has data imputation methods built in. COVID provides many challenges. We may need to explore the time series. If the series resumes after COVID ends (hopefully), we may need to define an event.
Do you have a threshold of overall missing data for a variable that you have where you would 'throw the variable out' due to low/lack of coverage instead of using imputation?
I use 49%. If I’m imputing over half the data you want to think about it unless you know why they are missing.
What would be preferable to input with the mean or the median? What is the effect of extreme data points when using some imputing method?
If you have extremes maybe median would be preferable as it isn't biased as much as means.
How do I start learning SAS Viya?
We have some free training resources available right now: https://www.sas.com/en_us/training/offers/free-training.html
We also have a free trial available: https://www.sas.com/en_us/software/viya.html
We also have a Viya training path: https://support.sas.com/training/us/paths/viya.html
Are there any rules, or methods that can be used that define how much missing data is acceptable? Are there any common tools used to measure how much missing data can be affecting the overall data?
50% or more is often an acceptable percentage. Other experts will say 20-30%.
Here are a couple of research articles on the topic
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3701793/
https://academic.oup.com/jpepsy/article/39/2/151/883719
What procedures do you have for predictive analysis for MSRP or mortgage data?
Lots in SAS/STAT, Enterprise Miner (nodes), or Visual Data Mining and Machine Learning (regression methods, decision trees, GLM models, gradient boosting depending upon what products you have access to.
So best way to evaluate imputation method is to test the predictions against a training dataset?
I would use a training data set to create the model and then use a test or validation data set for testing. I score off the validation data set and use this so neural networks and decision trees don’t over fit. Use the test data set (hold out data that has not been used) to create the model and prove that the model is expendable to new data coming in.
In EM or VDMML try several options related to imputation methods in the model - what are the results? Do they make sense?
Sometimes I have '0' and Missing values where the '0' is also wrong entry or r I have a char like 'not applicable' which I would consider as a missing value. How do I handle that in SAS EG/SAS Code?
If I understand correctly, there is a separate step for data replacement. To replace one value for another like "0" with "." , you can use SAS code or Query Builder in EG.
Is there a way to let SAS automatically find subsets of your data where multiple values are not missing like the subset of customers with credit card? They of course have credit card number and type not missing.
Use the Missing Data Pattern Task available in SAS Studio. This will give you the ability to select the columns you are interested in and you can use Group By if you need to create subgroups.
Recommended Resources
Working with Missing Values Documentation
Managing Missing Data Using SAS® Enterprise Guide®
Video: SAS® Enterprise Miner® Tip: Imputing Missing Values
Multiple Imputation of Missing Data Using SAS®
Want more tips? Be sure to subscribe to the Ask the Expert board to receive follow up Q&A, slides and recordings from other SAS Ask the Expert webinars.
... View more
10-08-2020
10:05 AM
1 Like
Hi SAS Data Management Community,
I’m presenting a live “Ask the Expert” webinar on October 20, 1 PM – 2 PM ET entitled Why Do I Have Missing Data and How Do I Fix it? Join me for this webinar where you will learn what to do when you have missing values in your data. I’ll define missing values, why and when they occur and how to manage them. I’ll discuss functions, procedures and how products like SAS ® Enterprise Guide ® , SAS ® Enterprise Miner ™ , SAS Studio and SAS ® Viya ® deal with missing values. At the end of the webinar, I’ll answer your questions during the Q&A session.
Join Me for this Session on Handling Missing Values in SAS
You will learn:
The definition of a missing value.
Why missing values happen.
How to manage missing values in SAS.
Click here to register for the webinar.
Want more tips? Be sure to subscribe to the Ask the Expert board to receive follow up Q&A, slides and recordings from this and other SAS Ask the Expert webinars.
Can't join the live event? You can view this and other Ask the Expert sessions on-demand here.
... View more
10-08-2020
10:02 AM
Hi SAS Data Mining and Machine Learning Community,
I’m presenting a live “Ask the Expert” webinar on October 20, 1 PM – 2 PM ET entitled Why Do I Have Missing Data and How Do I Fix it? Join me for this webinar where you will learn what to do when you have missing values in your data. I’ll define missing values, why and when they occur and how to manage them. I’ll discuss functions, procedures and how products like SAS ® Enterprise Guide ® , SAS ® Enterprise Miner ™ , SAS Studio and SAS ® Viya ® deal with missing values. At the end of the webinar, I’ll answer your questions during the Q&A session.
You will learn:
The definition of a missing value.
Why missing values happen.
How to manage missing values in SAS.
Click here to register for the webinar.
Want more tips? Be sure to subscribe to the Ask the Expert board to receive follow up Q&A, slides and recordings from this and other SAS Ask the Expert webinars.
Can't join the live event? You can view this and other Ask the Expert sessions on-demand here.
... View more
09-17-2020
08:40 AM
What type of model are you trying to export PMML code for? Not all modeling types are supported in PMML.
... View more
08-25-2020
06:59 PM
2 Likes
Did you miss the Ask the Expert session on integrating SAS Viya and open source? Not to worry, you can catch it on-demand at your leisure.
Watch the webinar
SAS delivers an open analytics platform, built on the latest cloud technology and accessible from the interface or in the coding language of your choice, giving you the freedom to experiment and create. Combining the power of SAS with open source technologies, you can unify disparate tools and analytic assets into a streamlined, collaborative environment – fostering productivity, business agility and tangible results. This webinar will help you learn the many ways SAS ® Viya ® integrates with open source.
During this webinar, you will learn how to:
Access the power of SAS using your existing skills, like SAS, open source or other programming skills.
Use Python or R in the analytical flow of pipelines.
Use Python or R through the SWAT package.
Here are the questions from the Q&A segment held at the end of the webinar. The slides from the webinar are attached.
Which nodes does the Compute node represent i.e. CAS Controller or the MicroServices or the SPRE node?
CAS controller
What does CAS stand for?
Cloud Analytics Server and that’s the engine that SAS Viya runs on.
Is there integration with SAS 9 and open source?
Yes, there is integration with SAS 9. You can find more on that on GitHub.
What version of R will work with SAS Viya?
SAS Viya will work with any version of R if your pointers are set up to point to the right version.
What version of Python will work with SAS Viya?
SAS Viya will work with any version of Python.
Can I use both Python and R in the same Jupyter Notebook?
No, you would use one or the other in the Jupyter Notebook. They are specific to the language you want to use.
Where do I download the SWAT Package?
All the packages are out on GitHub.
Can we do these things in the Viya for Learners environment as well?
Yes, Integration is available in the Viya for Learners environment. More information on Viya for Learners can be found here.
What version of Viya are you using in the demonstration?
Viya 3.5
Viya can run Python or R on CAS. Can Scala be input into the CAS also?
To clarify - a data scientist can, within a single Jupyter Notebook session (for example), run Python or R packages natively through Python or R AND run CAS Actions using Python or R syntax so, for example, in one session you can run a Python GB model in Python and a CAS GB model using CAS actions and compare the two. CAS actions are distributed in-memory.
What SAS Viya version can integrate with open source?
All SAS Viya versions support open source integration, but there are further enhancements and capabilities added with each release.
Is it possible to do autotuning in an open source coding like in a pipeline supervised learning options?
Yes. On GitHub there is a data science notebook that has examples of doing autotuning within Jupyter Notebooks.
Can you please give more details on pickle models, exported from Python. Can I use those pickle files in SAS pipelines?
There are examples of pickle models on GitHub.
How do the various integrations of open source handle open source version dependencies?
SAS Viya is agnostic to versions of open source, but you want to be careful because the versions are different so it would be a good idea to standardize on a version within your company.
Does SAS offer Python and/or R instruction training and certification?
There are specific classes on R/Python in the context of working with CAS. You can find more info on those classes here.
What would be a good resource to use Python within Model Builder, including importing packages in the Viya instance?
Here are a few resources that you may find useful:
•GitHub Open Source Code Node Resources
•Executing Open Source Code in SAS Visual Data Mining and Machine Learning Pipelines (video)
•Open Source Code Node Documentation
•Open Source Code Node Example
For Dynamic Data streaming (transaction data in particular), which python package should I use?
There are several packages on GitHub that work with SAS Event Stream Processing solution.
How are Python execution environment managed for each open source code node? Can different nodes use different sets of requirements?
Yes, management is dependent on your company requirements. Every run of the Open Source Code node is a separate invocation of the Python distribution on the Compute Server.
Can open source code node import private Python packages?
You can import any packages that are installed on the Python distribution that is required on the Compute Server for Open Source Code node to work.
Recommended Resources
SAS® Viya® Enablement
Programming for SAS® Viya®
An Introduction to SAS® Viya® 3.5 Programming
Want more tips? Be sure to subscribe to the Ask the Expert board to receive follow up Q&A, slides and recordings from other SAS Ask the Expert webinars.
... View more
Labels:
08-03-2020
01:32 PM
Hi Data Mining Community,
I’m presenting a live “Ask the Expert” webinar on August 18, 11 AM – Noon ET. SAS delivers an open analytics platform, built on the latest cloud technology and accessible from the interface or in the coding language of your choice, giving you the freedom to experiment and create. Combining the power of SAS with open source technologies, you can unify disparate tools and analytic assets into a streamlined, collaborative environment – fostering productivity, business agility and tangible results. Join me as I discuss the many ways SAS Viya integrates with open source.
You will learn how to:
Access the power of SAS using your existing skills, like SAS, open source or other programming skills.
Use Python or R in the analytical flow of pipelines.
Use Python or R through the SWAT package.
Click here to register for the webinar.
Want more tips? Be sure to subscribe to the Ask the Expert board to receive follow up Q&A, slides and recordings from this and other SAS Ask the Expert webinars.
Can't join the live event? You can view this and other Ask the Expert sessions on-demand here
... View more
06-18-2020
09:24 AM
Aligned Box Criterion is available in the HP Cluster node in SAS Enterprise Miner. It will determine the optimum number of clusters. Here's a video that talks about using this option, along with using CCC and gap methods https://www.youtube.com/watch?v=NZpNTkfT47c
... View more
06-02-2020
10:30 AM
Slides and Q&A for this session have been added here.
... View more
06-02-2020
10:27 AM
1 Like
Did you miss the Ask the Expert session on How Do I Get Started with SAS Visual Data Mining and Machine Learning? Not to worry, you can catch it on-demand at your leisure.
Watch the webinar
This session provides information on how SAS Visual Data Mining and Machine Learning is designed for the data scientist, statistician and advanced business analyst.
Whether you want to program or point and click, SAS Visual Data Mining and Machine Learning provides innovative algorithms and fast, in-memory processing.
This session covers its capabilities with an accompanying demonstration showing the components of SAS Visual Data Mining and Machine Learning.
You will learn how to:
Interactively program in a web-based development environment.
Use intelligent automation, including Automatic Feature Engineering node for automatically cleansing, transforming and selecting features for models.
Use natural language generation.
Incorporate embedded support for Python and R languages.
Use highly scalable, distributed in-memory analytical processing.
Here are some highlighted questions from the Q&A segment held at the end of the session for ease of reference.
Does SAS Visual Statistics come with SAS Visual Data Mining and Machine Learning?
As part of the VIYA platform when you license SAS Visual Data Mining and Machine Learning you also have SAS Visual Analytics and SAS Visual Statistics.
Can I add SAS Visual Data Mining and Machine learning to my current SAS install?
SAS Visual Data Mining and Machine Learning is a part of the new VIYA platform. SAS VIYA and SAS 9.4 can be integrated and interact with each other and they are a separate installation and configuration.
Is it possible to download this data?
This data is available with SAS Enterprise Miner as a sample dataset. You can find it in the SAMPSIO library.
Is autotuning available in the Visual Interface?
In the current version of SAS VDMML 8.5 autotuning is available through both the visual and programming in SAS VIYA or Jupyter Notebooks or through the task in SAS Studio.
Is there integration with R?
Yes, Python, Java, R and Lua are supported. You can call SAS VDMML using your preferred language in our world-class, governed environment. And using REST APIs, you can add the power of SAS Analytics to your custom applications.
Is this part of SAS VIYA?
Yes. SAS Visual Data Mining and Machine Learning (VDMML) is one of the SAS Viya products.
How do you get to SAS Drive?
Your SAS Administrator can provide you with the URL or Web address to give you access to SASDrive.
What is the limitation of running machine learning models in Visual Analytics like this, in comparison to running Model Studio. I understand Model Studio is the full-brown ML platform?
The visual interface shown give you access to Machine Learning techniques through a point and click visual environment. Many of the options are available in this interface. Model Studio or pipelines has additional options including the ability to build our entire Machine Learning process flows through point and click and the integration of open source and Natural Language generation. Programming will provide more options including both Procedures and CAS Actions.
What are the measures to compare the models?
You have several options with VDMML. You can compare with Lift, KS, and Misclassification, just to list a few.
Can we build models without VDMML, just using visual statistics and using python?
Absolutely, You can build models with point and click in Visual Statistics or you could open up Jupyter Notebooks and model with python.
What do I use to manage clinical chemistry patient data? Patient had 3 different data results from 1) normal healthcare site; 2) community general hospital site and 3) area ER site that are not consistent.
SAS Viya has several tools for helping you manage data. Within the interface showed you there are two menu options under the Hamburger or Action menu, Manage Data and Prepare Data. Manage Data allows you to bring data (including SAS, database data, or local files like .csv) into the CAS environment. Prepare Data allows you to join or append multiple data sets, transform columns, use custom code, filter, partition and more.
After using the template that automatically estimate hyper parameters, what the indication for the need for further tuning?
I recommend that you examine diagnostic statistics to determine how well the model is performing. Misclassification rate or ROC for example.
How does SAS Enterprise Miner fit into the picture?
Enterprise Miner(EM) is a client/server available with our SAS 9 platform. If you have a newer version of Enterprise Miner you can integrate with Viya Visual Data Mining and Machine Learning if its licensed. You can run Viya models from within EM and you can run batch code from EM and score code from EM in Viya
I have not used Visual Analytics but I have been using Enterprise Miner. What advantages Visual Analytics provide over EM on machine learning?
Visual Data Mining and Machine Learning is provided on the Viya platform which mean it is cloud based and the data is loaded in memory. It is utilizing the newest architecture and technology. VDMML allows easier integration with open source. Also Visual Data Mining and Machine learning, provide more machine learning algorithms and options for those algorithms along with the ability to hypertune the model. When using tools in Viya you have one interface (the browser) to use to access and modify your data and move between your analytic tasks from modeling to visualizing results.
We are using SAS Enterprise Guide, what shall we do if we want to purchase Viya? Is it a different product?
SAS Enterprise Guide can be an interface to the SAS Viya environment. You would need to license, install and configure Viya to work with SAS Enterprise Guide. Please contact your SAS representative to inquire about purchasing a license for SAS Viya. We can make sure to pass your name to the Sales representative if you'd like.
What is ML with Explanation?
Explanation provides natural language to explain the results of your analysis. An example showed were the insights shown when doing pipeline comparison.
Is this SAS Visual Dating Mining and Machine Learning useful for data cleaning?
VDMML is part of the Viya platform which has components for data cleaning. VDMML itself can do things like imputation and feature transformation, but data management can be done with SAS Studio and Prepare Data.
How about build models module? I checked our environment and we don’t have 'build models' whereas we do have 'manage models'.
It appears you don't have VDMML licensed in your environment.
Is it possible to get access to SAS visual analytics to learn basic machine learning techniques for about two weeks?
Use this link to start a free trial. Orange button at the top right: https://www.sas.com/en_us/software/visual-data-mining-machine-learning.html
Documentation for Bayesian Networks?
Link to documentation for Bayesian Networks
Can VIYA do ensembles?
Yes, Visual Data Mining and Machine Learning can ensemble your models. Ensemble is one of the nodes available in pipelines.
Is there Python integration in SAS visual analytics? Yes. Viya in general has Python integration. You can integrate with Python in Pipelines, SWAT package in Jupyter Notebooks, and Proc FCMP in SAS Studio.
Does python embedded in SAS Viya or does it have to be install where the SAS engine is?
Python needs to be installed with SAS.
Do you have documentation on VDMML?
https://support.sas.com/en/software/visual-data-mining-and-machine-learning-support.html
... View more
Labels:
05-28-2020
04:48 PM
Slides and Q&A will be posted next week once we have complied all the Q&A.
... View more
05-26-2020
10:56 AM
1 Like
Hi Data Mining and Machine Learning Community,
I’m presenting a live “Ask the Expert” webinar on May 27, 11:00 – 12:00 p.m. ET How Do I Get Started with SAS ® Visual Data Mining and Machine Learning? Please join me.
About the webinar:
SAS Visual Data Mining and Machine Learning is designed for the data scientist, statistician and advanced business analyst. Whether you want to program or point and click, SAS Visual Data Mining and Machine Learning provides innovative algorithms and fast, in-memory processing. This session covers its capabilities and has an accompanying demonstration that provides a view into the components of SAS Visual Data Mining and Machine Learning
What you will learn
Interactive programming in a web-based development environment
Intelligent automation including Automatic Feature Engineering node for automatically cleansing, transforming, and selecting features for models.
Natural language generation
Embedded support for Python & R languages
Deep learning with Python (DLPy)
SAS® procedures (PROCs) & CAS actions
Highly scalable, distributed in-memory analytical processing
Are there any other specific questions you’d like covered? Let me know by responding to this thread.
Register Now to join me for this webinar
Want more tips? Be sure to subscribe to the Ask the Expert board to receive follow up Q/A, slides and recordings from other SAS Ask the Expert webinars. To subscribe, select Subscribe from the Options drop down button above the articles.
Can't join the live event? You can view this and other Ask the Experts sessions on-demand here.
... View more
09-25-2019
05:18 PM
Q&A has been updated and slides uploaded here
... View more
09-13-2019
01:31 PM
Hi Data Mining Community,
I’m presenting a live “Ask the Expert” webinar on September 20, 1 - 2 p.m. ET on Getting Started with SAS Visual Statistics. I hope you’ll join me.
SAS Visual Statistics is designed for statisticians and business analysts. You can quickly create predictive models through the visual point-and-click interface or through programming using SAS Studio or other programming interfaces. This session covers the what, why, and how for using SAS Visual Statistics 8.4.
I plan to:
Interactively create and refine descriptive and predictive models – for insights fast and bright as lightning.
If you’re a business analyst, statistician or data scientist who explores data or builds models, this webinar is for you.
Learn about SAS Visual Statistics and its capabilities (including its seamless integration with SAS Visual Analytics), including how to:
Apply statistical modeling techniques to explore data visually.
Create models, including decision trees and logistic regression.
Compare models by looking at different criteria
Show clustering and apply to create segmented models
Are there any other specific questions you’d like covered? Let me know by responding to this thread.
Register Now to join me for this webinar
... View more
08-16-2019
10:18 AM
CAS Actions are the tools used to interact with data on the CAS Server as part of the Viya platform. CAS actions are similar to traditional SAS procedures and in fact are the underlying units for SAS Procedures in Viya. CAS Actions act and behave more like the methods and options used in Open Source. In Viya you can choose in many instance to use either the SAS Procedure or the CAS Actionset. For example to create a Random Forest model I could use PROC FOREST or the CAS Actionset decisionTree with the CAS action forestTrain. Both will give me the same results. A SAS programmer may feel more comfortable using the PROCs where as an Open Source Programmer may feel more comfortable with the CAS Actions.
... View more
08-06-2019
02:22 PM
The Q&A and slides for this session on August 1, 2019 have been added to an article you can find here.
... View more