02-21-2024
jlh368
Fluorite | Level 6
Member since
10-25-2016
- 9 Posts
- 4 Likes Given
- 0 Solutions
- 0 Likes Received
-
Latest posts by jlh368
Subject Views Posted 866 06-02-2021 08:09 AM 926 06-01-2021 04:40 PM 855 11-28-2017 04:02 PM 1485 08-02-2017 05:07 PM 22316 08-02-2017 04:10 PM 1256 07-27-2017 09:00 AM 1609 07-26-2017 02:19 PM 1670 07-12-2017 07:53 AM 1732 07-06-2017 04:56 PM -
Activity Feed for jlh368
- Posted Re: SQL Passthrough creates character vars of length 32767 on SAS Programming. 06-02-2021 08:09 AM
- Posted SQL Passthrough creates character vars of length 32767 on SAS Programming. 06-01-2021 04:40 PM
- Liked Tip: How to Apply Path Analysis in SAS® Enterprise Miner™ to Gain Insight into Sequential Behaviors for rayIII. 03-01-2018 03:06 PM
- Posted Re: unknown error when opening new project - Sas enterprise miner on SAS Data Science. 11-28-2017 04:02 PM
- Posted Re: Oversample and Score classification example on SAS Data Science. 08-02-2017 05:07 PM
- Posted Re: Tip: How to model a rare target using an oversample approach in SAS® Enterprise Miner™ on SAS Communities Library. 08-02-2017 04:10 PM
- Liked Tip: How to model a rare target using an oversample approach in SAS® Enterprise Miner™ for M_Maldonado. 08-02-2017 04:10 PM
- Liked 10 SAS Enterprise Miner shortcuts you’ll want to keep handy for AnnaBrown. 07-31-2017 11:18 AM
- Posted SAS Enterprise Miner 14.1 Ensemble model with Decision Node on SAS Data Science. 07-27-2017 09:00 AM
- Liked Re: ensemble models for WendyCzika. 07-27-2017 08:33 AM
- Posted Oversample and Score classification example on SAS Data Science. 07-26-2017 02:19 PM
- Posted Re: Enterprise miner - WARNING: Physical file does not exist on SAS Data Science. 07-12-2017 07:53 AM
- Posted Enterprise miner - WARNING: Physical file does not exist on SAS Data Science. 07-06-2017 04:56 PM
-
Posts I Liked
Subject Likes Author Latest Post 1 4 7 1
06-02-2021
08:09 AM
Thanks that is a good direction to question. The SQL column is varchar(max). It is used to store multiple types of data with an associated key field. It looks like the field that is being returned was created as an integer when it should have been a character field of length 10. Instead it was stored as an integer hence the max length standard integer being returned. Thanks!
... View more
06-01-2021
04:40 PM
Hello,
Support on an external server for the SQL driver ended. We are using in our ODBC connection with the driver SQL Server. We recreated the connection with ODBC Driver 17 for SQL Server. This reestablished our connection but the data returned for some character vars is returned at a length of 32767. I can cast the vars in the pass thru to get smaller numbers cast(var as varchar(9)) as var_name1. This seems like an extra step. What am I missing? Should I use the SAS/Access driver?
... View more
11-28-2017
04:02 PM
Hi, Are you able to run the nodes in the other projects or just view them? I had a similar issue and caused by an EM license update that I hadn't applied. Larry
... View more
08-02-2017
05:07 PM
I took a deeper dive into the example listed above and I realize there are many inputs that affect the score percentages. The change I had questioned below, the scoring percentages being closer to the original data set percentages, was the effect of the sample proportion. I adjusted the data partition percentages from Train/validate 50/50 to 70/30 and noticed the change in the model. This change, in turn, affected the scoring proportions. I also did see the updated prior probabilities in the SAS score code node. In short, it was doing what it was supposed to do, and I learned a bit. Any suggestions on topics to follow up on from here?
... View more
08-02-2017
04:10 PM
Hi, the diagram attached differs from the instructions listed above. The Decision node in the diagram under Train shows the following. Decisions set to Property (Custom in the example above) and Matrix is set to inverse priors. Following along, I switched this to custom and checked the decision matrix. This is set to "Do you want to use decisions?" Yes and has decision weights 1.428/0, 0/3.33. These would correspond to the weights of the original data set and not the first sample.
... View more
07-27-2017
09:00 AM
Hi all, I have a large data set that I have sampled (oversampled) and have a few models(Decision Tree, regression, neural) in a workflow. In addition, there is an ensemble model of the decision tree and regression. I have these 4 models going to a model comparison node. My question is where to put the decision node(s) to adjust the priors. In the Sas EM 14.1 Help for the Ensemble Node there it states "When you create a process flow diagram that contains an Ensemble node, do not specify prior probabilities in the diagram before the modeling nodes. In order to obtain the correct fit statistics for the combined unadjusted posterior probabilities, follow the Ensemble node in your diagram with a Decisions node, and use the Decisions node to specify probabilities." Should I use multiple Decision Nodes, one for each model before the model comparison node or just one after the model comparison node? Should the priors be adjusted for the model assessment or for the scoring? Thanks for the help Larry
... View more
07-26-2017
02:19 PM
Enterprise miner 14.1 Hello, I am following this example https://communities.sas.com/t5/SAS-Communities-Library/Tip-How-to-model-a-rare-target-using-an-oversample-approach-in/ta-p/223599?nobounce to familiarize myself with Oversampling. As an additional learning, I connected a score node to the model comparison node. My thought is to copy the original data set and the first sample and score this data set. So, I added set a copy of the original German Credit with a role of score and copied the first sample node (same seed, same sample size, and same event percent .05/.95) and ran the workflow. Class Variable Summary Statistics Data Role=SCORE Output Type=CLASSIFICATION Numeric Formatted Frequency Variable Value Value Count Percent I_good_bad . BAD 204 34 I_good_bad . GOOD 396 66 Data Role=SCORE Output Type=MODELDECISION Numeric Formatted Frequency Variable Value Value Count Percent D_good_bad . BAD 226 37.6667 D_good_bad . GOOD 374 62.3333 I had expected the results to be closer to the sample proportions (Bad .05 vs Good . 95), but the results appear close to the original data set. When I look at the score code, I see the original data set's posterior probabilities with no adjustment. Label P_good_badgood='Predicted: good_bad=good'; P_good_badgood = 0.7; Label P_good_badbad='Predicted: good_bad=bad'; P_good_badbad = 0.3; Am I just approaching this problem incorrectly? Have I made an error or just an error in understanding? I've attached a copy of my workflow, I renamed it .jpg. If you drop this you should be able to import into EM. Thanks!
... View more
07-06-2017
04:56 PM
Enterprise miner - WARNING: Physical file does not exist Hi, While running a diagram in EM 14.1 installed on my machine locally, the machine crashed and rebooted. After reboot, I am not able to view results in the diagram that was running. I tried to update the dataset and I see a warning in the log WARNING: Physical file does not exist, C:\Users\xxx123\AppData\Local\Temp\SAS Temporary Files\_TD8712_xxx123_\Prc2\DiagramOpenSessionResponse.xml. The file referenced does exist on that path. I am able to run other diagrams in the same project. I have tried the following - removing and readding the data sets - this shows the same warning I am not able to update the path. My guess is it's time to recreate, but one last chance before I restart. Thanks!
... View more