About BrianGaines

BrianGaines · ‎11-17-2022

@acordes, thank you for the kind feedback, I'm really glad that the article was useful for you!

BrianGaines · ‎10-08-2022

Hi @Nietzsche, Which version of SAS Studio are you using? Newer versions of SAS Studio on SAS Viya do have a dark theme. If you click your username in the upper-right corner to access the Application options --> Settings --> General --> Theme --> Choose a theme --> select either "Ignite" or "Dark" as the theme. I am not sure if this was available in earlier versions of SAS Studio (such as the 3.x series) but I will try to check. Best, -Brian

BrianGaines · ‎04-28-2022

@vteitler, To modify the PROC IMPORT code, first open the Import Wizard to import the file again but do not run the code. Instead, click the Edit button to open the generated code in a new editor where you can edit it: After the code opens in a new editor, use the * symbol to comment-out the GETNAMES=YES option: Now, run the code and it should import the data set successfully: So then you can use the data set with the Summary Statistics task: Did that fix the issue?

BrianGaines · ‎04-28-2022

Yes, one option is that you could delete rows 1-10 in the underlying MIS445FRED-realGDP.xlsx data set and re-import it. Another option is to modify the PROC IMPORT code to disable the GETNAMES option. If I do that, then the data set does import successfully, because you want the SAS data set to look like the following, in which there are two columns and the data values are in row 1:

BrianGaines · ‎04-28-2022

The Analysis Variables role in the task is only for numeric columns, but from your screenshot, it looks like WORK.IMPORT contains only 2 columns ("B" & "FRED Graph Observations"), and those are both character columns. What does the underlying data file, MIS445FRED-realGDP.xlsx, look like? In the Libraries section, if you double click on IMPORT under WORK to view the data set, does the data set look like what you would expect?

BrianGaines · ‎04-28-2022

Hi @vteitler, In the navigation pane on the left hand side within SAS Studio, if you go to Libraries --> My Libraries --> Work, do you see the IMPORT data set there? Such as in the following screenshot:

BrianGaines · ‎04-28-2022

Hi @vteitler, Could you please include the code you are using to import the data, and the error message you receive when you run the task? Thanks, -Brian

BrianGaines · ‎04-25-2022

Hi @ZS2, Can you please provide a screenshot of your issue? Also, please see my solutions on a couple of other threads that might be related: https://communities.sas.com/t5/SAS-Software-for-Learning/OnDemand-SAS-Studio-LOG-tab-missing/m-p/752628#M80 https://communities.sas.com/t5/SAS-Studio/Code-Editor-Does-Not-Display/m-p/630444#M8890 Does that fix your issue? If not, please let us know and we can try to help. Best, -Brian

BrianGaines · ‎03-24-2022

In a companion blog, "Accuracy vs. Interpretability? With Generalized Additive Models (GAMs), You Can Have Both," we provide an overview of generalized additive models (GAMs) and their beneficial features. GAMs are appealing because they strike a nice balance between flexibility and performance while maintaining a high degree of interpretability. This article provides the complete step-by-step instructions to reproduce the data analysis in the companion blog, to show you how easy it is to use Model Studio to train a GAM and compare it to other machine learning models. To do that, let’s revisit an example from Lamm & Cai (2020) that trains a GAM to predict the probability that a mortgage applicant will default on a loan. See Example 2 in Lamm & Cai (2020) for more information about the example and the Hmeq data set. For a snapshot of this example, watch the following demo video: Data & Project Setup First, let’s run the following code within SAS® Studio in SAS® Viya® to start a new session with a SAS® Cloud Analytic Services (CAS) server, assign the Mycas libref to the Casuser caslib, and create the data set used by Lamm & Cai (2020) as the in-memory table HmeqLC in the Casuser library. Here, the DATA step code differs from Lamm & Cai (2020) in that it renames the processed data set to distinguish it from the original Hmeq data set. It also includes the PROMOTE= data set option to create the CAS table with global scope so that it is accessible from Model Studio. cas; libname mycas cas caslib="casuser"; data mycas.hmeqLC(promote=yes); set sampsio.hmeq; if cmiss(of _all_) then delete; if CLAge > 1000 then delete; part = ranbin(1,1,0.2); run; Next, let’s switch to Model Studio, part of SAS® Visual Data Mining and Machine Learning, and use the GAM node[1] to train the GAM. To do this, access the Applications menu in the upper-left corner and select Build Models (Figure 1). Figure 1 Open Model Studio via the Build Models selection in the Applications menu. Within Model Studio, select New project and create a Data Mining and Machine Learning project with the HmeqLC data table (Figure 2). Figure 2 Create a new Model Studio project with the preprocessed data set, HmeqLC. In the Data tab, select the variable BAD and assign it to the role of Target (Figure 3). Figure 3 Set the variable BAD as the target. Next, select the part variable, assign it to the role of Partition, select Map Partition Levels, and map Training Level to 0 and Test Level to 1 (Figure 4). Figure 4 Map the levels to the partition variable to match Lamm & Cai (2020). Train a GAM with the GAM Node Now you are ready to switch to the Pipelines tab to create a model building pipeline. To add a GAM node to your pipeline, right-click the Data node → Add child node → Supervised Learning → GAM (Figure 5). Figure 5 Steps to add the GAM node to your pipeline. The GAM node enables you to train a GAM without the need to write any code. The node contains a variety of options to customize your analysis, such as the interval target probability distribution[2], the target link function, effects options, and options related to the chosen selection method (Figure 6). Figure 6 The GAM node includes many options to customize your model. For example, you can use the following steps to train a model similar to the one in Lamm & Cai (2020): Expand the Effects Options group Click the button next to the Bivariate Splines group to include bivariate splines in the model [3] Select Use all observations to construct spline basis functions Expand the Boosting Options group Set Learning rate to 0.2 Set Early stopping stagnation to 10 and Early stopping tolerance to 0.0005 At the top of the GAM node, click to run the node Results After you use the node to train the model, right-click the GAM node and select Results to view the model’s results. For the effects in the final model, the results include smoothing component plots for the spline terms and parameter estimates for the parametric terms. For example, the results include a smoothing component plot for the Spline(Debtinc) term (Figure 7). You can see that the probability of default is generally higher for an applicant who has a high debt-to-income ratio and the relationship is nonlinear. Figure 7 The probability of default is generally higher for higher debt-to-income ratios. Given the importance of the smoothing component plots, the GAM node includes options that govern the display of the plots. For example, you can select Use a common vertical axis to use the same y-axis range for each univariate spline plot. This facilitates comparisons of the magnitudes of the estimated spline effects on the predicted target. In addition, the ability to interpret the parametric effects like you would with a logistic regression model is another way in which the GAM’s results are relatively easy to understand. For example, an applicant with more delinquent credit lines has a higher predicted probability of default, on average (Figure 8). Figure 8 More delinquent credit lines typically correspond to a higher probability of default. Model Comparison Now that you have used the GAM node to train a GAM, let’s train a few other models for comparison, which you can easily do in Model Studio. Repeat the previous steps to add a Supervised Learning node to your pipeline to add the Gradient Boosting, Logistic Regression, Neural Network, and SVM nodes, and then click Run pipeline to train and compare all these models. Right-click the Model Comparison node to view the Results. If you sort the algorithms by Misclassification Rate, you see that the GAM ranks second (Figure 9). Figure 9 The GAM outperforms some of the more complex models but maintains interpretability. Even though the GAM has a slightly higher misclassification rate, it is more interpretable than the champion gradient boosting model and still outperforms other complex models such as SVM and neural network. As you can see, Model Studio enables you to easily train a GAM alongside other common machine learning models and to compare model performance. This is done in just a few clicks and keystrokes to make analytics more accessible to a broader range of people, empowering them to use data to improve decision making. Acknowledgments The author is grateful to Wendy Czika, Michael Lamm, Weijie Cai, and Brett Wujek for their help and feedback during the development of the GAM node. Thanks also to Wendy Czika, Anna Brown, and Ed Huddleston for their helpful comments on an early draft. Additional Resources In addition to the resources linked in the article, the following resources provide further information about training GAMs with SAS: Introducing the GAMSELECT Procedure for Generalized Additive Model Selection (SAS Global Forum 2020 presentation) The GAMSELECT Procedure documentation The GAMMOD Procedure documentation [1] Available in SAS Viya Stable 2020.1.1 (December 2020) and Long-Term Support 2021.1 (May 2021). [2] A binary target variable, such as BAD, always uses the binary distribution. [3] This option adds bivariate splines for all pairwise combinations of the interval inputs, whereas the GAM in Lamm & Cai (2020) includes only a few bivariate spline terms.

BrianGaines · ‎03-02-2022

Hi @EvelynLau, Thanks for sending that screenshot. Based on that, you are currently using SAS Viya 3.5. The version numbers that I referenced in my previous post are for Viya 4, so you are not using the latest version and your version does not have the one-hot encoding method in the Transformations node. How do you access Viya? Is it through Viya for Learners, or another way (such as through an employer)? Thanks, -Brian

BrianGaines · ‎02-26-2022

Hi @EvelynLau, Do you know which version of SAS Viya you are using? The one-hot encoding transformation method for class inputs was added to the Transformations node in SAS Viya Stable 2021.1.2 (June 2021) and Long-Term Support 2021.2 (November 2021). However, if you do not have access to those versions (or newer), then the article in the SAS Community Library that @sbxkoenk referenced is the recommended workaround. Best, -Brian

BrianGaines · ‎07-07-2021

Hi @yahuipeng, Awesome to hear! I'm sorry that you had to deal with such a frustrating issue but I'm glad it's resolved now. Please post in this community again whenever you have additional questions. Best, -Brian

BrianGaines · ‎07-07-2021

Hi @yahuipeng, I am not completely sure if this is the issue that you are facing, but last year another user reported a similar issue with the CODE tab. The issue was that the CODE tab was accidentally split into its own vertical window within the main work area, and then the section containing the CODE tab was minimized. Please see my reply on that thread for instructions to see if that is in fact the issue: https://communities.sas.com/t5/SAS-Studio/Code-Editor-Does-Not-Display/m-p/630444#M8890 Does that fix the issue? Best, -Brian PS I have no idea why my text is showing up in ALL CAPS once I post it, it does not appear that way in the editor. I'm not yelling at you. 🙂

BrianGaines · ‎07-06-2021

Hi @andreas_zaras, I'm glad to hear that it's working now! Thanks for letting me know. Also, you are technically promoting the data table to have global scope (so it is accessible across different CAS sessions, which enables you to use it in different Viya applications) instead of session scope (accessible only in that specific CAS session), but the CAS library itself is not being promoted. Just wanted to let you know for your own knowledge because I know that it took me a little while to wrap my head around this sort of stuff when I first started to use Viya. Best -Brian

BrianGaines · ‎07-05-2021

Hi @andreas_zaras, The key is that you need to "promote" the data table so that it has "global" scope, which enables you to access it from different SAS Viya Applications such as Model Studio. Please see my answer to this in another thread: https://communities.sas.com/t5/SAS-Studio/Data-does-not-appear-in-in-SAS-Viya/m-p/737535#M9944 Does that solve your issue? If not, please let me know. Thanks, -Brian

Online Status	Offline
Date Last Visited	‎02-29-2024 08:38 PM

Re: How to Develop SAS Code to Train a Deep Learning Model for Image C...

Re: Is it possible to change to dark theme in SAS Studio?

Re: running the summary statistics task on imported data

Re: running the summary statistics task on imported data

Re: running the summary statistics task on imported data

Re: running the summary statistics task on imported data

Re: running the summary statistics task on imported data

Re: OnDemand for Academics SAS studio LOG is not showing

How to Train Generalized Additive Models (GAMs) in Model Studio

Re: One-hot encoding transformation method for class inputs is "missin...

Re: Macro error while performing logistic regression of each independe...

Re: Model Retraining- Is it using the same parameters from your model?

Free Webinar: Getting Started With SAS® Visual Data Mining and Machine...

Webinar on July 28th, 11 AM – Noon ET entitled Getting Started with SA...

Re: How does SAS Handle errors when using the proc python function. Tr...

Re: One-hot encoding transformation method for class inputs is "missin...

Re: Saspy- python package install error in notebook

Re: Copy and paste SAS Studio code to Word Document

Re: OnDemand SAS Studio: LOG tab missing

Re: OnDemand SAS Studio: LOG tab missing

How to Train Generalized Additive Models (GAMs) in Model Studio

How to Develop SAS Code to Train a Deep Learning Model for Image Class...

Writing Custom SAS® Studio Tasks That Use CAS Actions for Advanced Ana...

Re: How to Develop SAS Code to Train a Deep Learning Model for Image C...

Re: Is it possible to change to dark theme in SAS Studio?

Re: running the summary statistics task on imported data

Re: running the summary statistics task on imported data

Re: running the summary statistics task on imported data

Re: running the summary statistics task on imported data

Re: running the summary statistics task on imported data

Re: OnDemand for Academics SAS studio LOG is not showing

How to Train Generalized Additive Models (GAMs) in Model Studio

Re: One-hot encoding transformation method for class inputs is "missin...

Re: One-hot encoding transformation method for class inputs is "missin...

Re: OnDemand SAS Studio: LOG tab missing

Re: OnDemand SAS Studio: LOG tab missing

Re: SAS Viya for Learners - Create data sets throgh datalines

Re: SAS Viya for Learners - Create data sets throgh datalines

SAS Global Forum 2020

SAS Global Forum 2019