01-03-2023
RadhikhaMyneni
SAS Employee
Member since
04-09-2015
- 33 Posts
- 3 Likes Given
- 4 Solutions
- 29 Likes Received
-
Latest posts by RadhikhaMyneni
Subject Views Posted 1129 09-15-2022 08:41 AM 1154 09-14-2022 04:35 PM 553 08-25-2022 12:49 PM 971 06-14-2021 11:59 AM 1304 03-26-2021 03:11 PM 968 11-17-2020 08:46 AM 713 08-27-2020 11:28 AM 2089 04-13-2020 09:22 AM 1408 03-31-2020 06:15 PM 3099 02-25-2020 09:57 AM -
Activity Feed for RadhikhaMyneni
- Liked LightGBM in SAS Model Studio for chmedi. 01-02-2023 02:55 PM
- Liked Working with Dates in Automated Machine Learning Pipeline in Model Studio for AnujaNagpal. 11-10-2022 04:17 PM
- Posted Re: Integrating Python in SAS Entreprise Miner on SAS Data Science. 09-15-2022 08:41 AM
- Posted Re: Integrating Python in SAS Entreprise Miner on SAS Data Science. 09-14-2022 04:35 PM
- Posted Re: Integration Open Source whit SAS in Model Manager and Model Studio on SAS Data Science. 08-25-2022 12:49 PM
- Got a Like for Re: VDDML - Python Open Source Node and Plots. 06-15-2021 05:38 AM
- Posted Re: VDDML - Python Open Source Node and Plots on SAS Data Science. 06-14-2021 11:59 AM
- Posted Re: Unable to run Open Source Code (#Python) in SAS VDMML on SAS Data Science. 03-26-2021 03:11 PM
- Liked MLPA web application - accelerating the model lifecycle through APIs for peciro. 01-19-2021 08:42 AM
- Got a Like for Re: Encode Y in Open Source Code Node. 12-10-2020 11:30 AM
- Posted Re: Encode Y in Open Source Code Node on SAS Data Science. 11-17-2020 08:46 AM
- Posted Re: using python virtual environments in Model Studio\VDMML on SAS Data Science. 08-27-2020 11:28 AM
- Posted Re: Asked & Answered:How to change data source or data partition for an existing project in Mode on SAS Communities Library. 04-13-2020 09:22 AM
- Posted Re: Configuration venv Python for SASModelStudio on SAS Data Science. 03-31-2020 06:15 PM
- Got a Like for Re: NEW Automation and Interpretability - Visual Data Mining and Machine Learning 8.5. 03-01-2020 02:56 PM
- Posted Re: NEW Automation and Interpretability - Visual Data Mining and Machine Learning 8.5 on SAS Data Science. 02-25-2020 09:57 AM
- Got a Like for Re: NEW Automation and Interpretability - Visual Data Mining and Machine Learning 8.5. 02-21-2020 03:07 PM
- Posted Re: NEW Automation and Interpretability - Visual Data Mining and Machine Learning 8.5 on SAS Data Science. 02-21-2020 02:09 PM
- Tagged Asked & Answered: How can I pass data from an Open Source Code node to next node in SAS Model Studio on SAS Communities Library. 12-18-2019 11:34 AM
- Posted Asked & Answered: How can I pass data from an Open Source Code node to next node in SAS Model Studio on SAS Communities Library. 12-18-2019 11:33 AM
-
Posts I Liked
Subject Likes Author Latest Post 6 3 7 -
My Liked Posts
Subject Likes Posted 1 06-14-2021 11:59 AM 1 11-17-2020 08:46 AM 1 02-25-2020 09:57 AM 1 02-21-2020 02:09 PM 1 01-21-2019 10:54 AM -
My Library Contributions
Subject Likes Author Latest Post 11 2 2 5 14
09-15-2022
08:41 AM
Hi Filbert,
You run the javac command from the directory that contains the src directory. So in the example it is C:\SGF2015\OpenSrcIntegration.
To run as administrator, don't double-click and open the file with default editor but see if you can right-click on the file (from file explorer) and select Run as Administrator. If that does not work, try right-clicking on windows command prompt and Run as Administrator and then go to the directory where the file is and then open using notepad. The sasv9.cfg is a text file so once you have access, you should be able to edit it with notepad.
Radhikha
... View more
09-14-2022
04:35 PM
Hi Filbert,
The java file(s) (SASJavaExec.java) should be in src/dev directory and when you compile using "javac src/dev/* -d bin" command, the executable will go into bin directory.
For your second question, since it is a personal desktop you probably can open the sasv9.cfg file as a administrator - on Windows you typically do this by right-clicking on the file and selecting Run as Administrator.
Hope this helps,
Radhikha
... View more
08-25-2022
12:49 PM
Here is a blog post on building and deploying a machine learning pipeline using SAS and Python and it talks about registering a Python model that is built in the Open Source Code node (in Visual Machine Learning) to Model Manager.
... View more
06-14-2021
11:59 AM
1 Like
Hi Andreas,
For the plot to show up in the node results, you need to: (1) save it in jpg, png or gif format, (2) name it with rpt_ prefix, for example rpt_treeplot.png or rpt_treeplot.jpg and (3) save it in dm_nodedir folder (dm_nodedir is a variable available in the node editor pointing to a temporary working folder)
Here is some sample Python code that does this:
# Plot model residuals plt.scatter(pred, pred - dm_inputdf[dm_dec_target]) plt.axhline(y=0, color='r') plt.title('Residual plot') plt.ylabel('Residual') plt.savefig(dm_nodedir + '/rpt_residuals.png') plt.close()
Hope this helps,
Radhikha
... View more
03-26-2021
03:11 PM
When an error occurs in the Open Source Code node, the generic message Encountered error code 1 when executing PYTHON program is highlighted in the log. The detailed error messages that help pinpoint the problem i.e., the messages coming from Python itself are displayed above this generic message; you can scroll up to view them.
Additional debugging questions are documented here:
https://go.documentation.sas.com/?cdcId=vdmmlcdc&cdcVersion=8.3&docsetId=vdmmlref&docsetTarget=n09n0yjpv48gddn0z5gv364f95qv.htm&locale=en
... View more
11-17-2020
08:46 AM
1 Like
Hi, When you say it failed, can you tell if the failure was in Python or afterwards in the Open Source Code node?. Did you place the node in the Supervised Lane? Also, can you post back the failure messages from the log - When an error occurs in the Open Source Code node, the generic message Encountered error code 1 when executing Python program is highlighted in the log. The detailed error messages that help pinpoint the problem are displayed above this generic message and you can scroll up to view them. You can also search for the first occurrence of executeProcess string in the log to see the start of these detailed error messages. I would think your use case should work as the only two columns needed in dm_scoreddf dataframe that the node expects you to create (if it is in Supervised Lane) are posterior probabilities. I am assuming here that you have a binary or nominal target. It should not matter that you label encoded the target and used it in the Python code. Radhikha
... View more
08-27-2020
11:28 AM
Currently the Open Source Code node in Model Studio does not have a way to invoke a specific virtual environment in the Python install but I do agree that it is useful and will try to look into adding in future release.
When the node executes, the Python executable configured is used to make a call similar to below code where <fileToRun> is constructed on the fly mostly using code from the node editor.
python <fileToRun>
... View more
04-13-2020
09:22 AM
Hi Prajna,
You can always turn off data partitioning at project creation time. When creating a project, click on the Advanced button on the "New Project" window, select "Partition Data" tab and un-select "Create partition variable" check-box (see pic below). Note that you will not be able to change this setting after the Data node in any one of the pipelines is run.
Radhikha
... View more
03-31-2020
06:15 PM
Hello,
Currently there is no way to invoke a python virtual environment from the Open Source Code (OSC) node because the node just calls the python executable it finds (mostly in PATH) and invokes its base environment.
You can update the PATH by modifying the sas-compsrv file under the /opt/sas/viya/config/etc/sysconfig/compsrv/default directory by adding the following line: export PATH=path_to_your_python_bin_directory:${PATH}
Radhikha
... View more
02-25-2020
09:57 AM
1 Like
That is correct, this capability tries different flow architectures with various model types and also optimizes for the best hyperparameters for each model type. It then picks the top 5 (5 by default but that number is configurable) models and draws those flows as a pipeline in Model Studio.
Radhikha
... View more
02-21-2020
02:09 PM
1 Like
I would say RPM in EM is similar to the basic, intermediate and the advanced templates that are already shipped with Model Studio.
What Jonathan is referring here is a new capability that was added in Model Studio as part of VDMML 8.5 release that implements the autoML (automated machine learning) initiative; where the pipeline is built after trying different combinations of data-preparation, modeling and hyperparameter turning steps for the input data specified. How many combinations it tries depends on how much time you give for the maxModelingTime parameter.
Radhikha
... View more
12-18-2019
11:33 AM
2 Likes
Question
How can I pass data from an Open Source Code node to next node in SAS Model Studio?
Answer Using the “Use output data in child nodes” property available in SAS Visual Data Mining and Machine Learning 8.5 or later.
When this property is selected, user can write code in Python or R and should make the output data available in the dm_scoreddf data frame (if “Generate data frame” property is selected) or the node_scored.csv file. While executing, the node saves this output data and uses it as input in subsequent child node(s). Even though data can be passed through the Open Source Code node with this new property, capabilities such as download score code, download score API, register models, publish models or score holdout data will not be enabled for open source models as there is no underlying SAS score code for them.
An example of passing data created in Open Source Code node in Data Mining Preprocessing lane to subsequent modeling node (Forest and Gradient Boosting) can be found on GitHub at sas-viya-dmml-pipelines project.
Example on GitHub
... View more
- Find more articles tagged with:
- Asked & Answered
Labels:
04-24-2019
10:15 AM
2 Likes
Question
Is it possible to create indicator variables (for each input) for missing values without actually imputing in Model Studio?
Answer Yes, add Imputation node to the pipeline, change the following properties and run the node.
Under Class Inputs, change Default method to none
Under Interval Inputs, change Default method to none
Under Indicators, (1) select Unique indicators check-box, (2) set Indicator subject to Missing variables and (3) set Indicator role to Input.
You will see that there is a new variable (with M_ prefix) for every input that has a missing value in the training data. This new variable takes 0 or 1 values to indicate missingness (0=not missing, 1=missing).
... View more
- Find more articles tagged with:
- Asked & Answered
Labels:
04-03-2019
08:44 AM
5 Likes
Question
Can the R ggplot package be used in the Open Source Code node for visualization purposes?
Answer Yes, you can use the Open Source Code node in Model Studio to visualize plots using ggplot2 package in R software. Other Python or R visualization packages can also be used. The plots need to be saved as image files with rpt_ prefix and .png, .jpeg or .gif file extension. They will be displayed in the Results after the successful execution of the Open Source Code node.
Multiple visualizations using ggplot2 and rpart.plot packages are shown in this GitHub example.
Example on GitHub
Note that the above example requires pre-installation of rpart, rpart.plot and ggplot2 packages in R software on the Compute Server.
A special thanks to @MelodieRush for sharing similar example using SAS Enterprise Miner that led to this work.
... View more
- Find more articles tagged with:
- Asked & Answered
Labels:
03-20-2019
08:58 AM
5 Likes
Question For an existing project, would it be possible to:
Reload the input table with additional records?
Reload the input table with additional variables?
Load the input table with different name?
Change the way data is partitioned after any pipeline is executed?
Answer The answer to all the above four questions is YES!
To change the data source or data partition for an existing project:
Open the existing Project and go to the Data tab.
Click on the “Data sources” icon on the left.
Click on “Replace data source” icon on the top left – this gives you the capability to select a new data source; whether it is table with additional records, table with additional variables or table with a new name.
Additionally, if you want change how the data is partitioned, follow the above steps and select the same or a new table. Then go to the “Partition Data” tab in Project settings (Top right icon to modify accordingly.
Note that the target variable role in a Model Studio project cannot be modified after executing any node or pipeline in it.
... View more
- Find more articles tagged with:
- Asked & Answered
Labels: