About SNLFAM1

SNLFAM1 · ‎09-09-2016

We recieve data shipped to us as large complex XML files with an XML schema included. Using the SAS XML Mapper 9.4 and the schema, I was able to use the "Auto Generate" process to create an XML Map (with auto generated keys) from the XML schema. This resulted in the XML being divied up into over 300 tables. I ran PROC COPY to pull all the tables into SAS... and it took about 2 days to finish. I tried the same XML file and XML map and ran a DATA step for one table. That step took a little more than 5 minutes. It appears based on the log that for each table, SAS reads through the entire XML file again. Is there a way to output to multiple SAS datasets with only one read through of the XML? --Shaun Code run on SAS 9.4 UNIX server: filename DataWare "/sas/data/LARGE.xml"; filename SXLEMAP "/sas/data/XML Maps/LARGE_AUTO.map"; libname DataWare xmlv2 xmlmap=SXLEMAP access=READONLY; libname tempo "%sysfunc(getoption(WORK))"; proc copy in=DataWare out=tempo; run;

SNLFAM1 · ‎08-21-2014

Thank you Jaap. As usual, your comments were helpful. I will be especially mindful of the limit on the number of parallel processes. I have come up with a partial solution by taking the following steps: Step 1: Turn on "Allow parallel execution on the same server" at the project level. A) Use the EG menu to go to FILE > Project Properties B) Select the section "Code Submission" C) Check the box next to "Allow parallel execution on the same server" D) Save the changes Step 2: Store the "master" work session path somewhere fixed to be read in by other parallel processes A) Create a "program" node in EG that reads in the WORK directory path and writes it out to a 'fixed' location a. I used the following code to create a user and project unique location for storing the path as a libname statement in a .sas file to be read in with a %include: %let projectdir=%sysfunc(compress(&_clientprojectpath,,kn)); data _null_; FILE "/sas/data/g_research/&sysuserid/paths/&projectdir..sas"; a="libname workshr '%sysfunc(pathname(work))';"; PUT a; run; b. Caution: You want to be sure that this file containing the path isn’t accidentally overwritten in the middle of your process flow. To accomplish this, I used automatic/system macro variables generated by EG to create something that is unique to this project, but common for all SAS workspace sessions spawned by this project. An alternative to using macros for the path is to simply have a fixed path, but this runs a higher risk of being overwritten by another project using the same path. I am trying to make this as portable as possible to be able to use in many different EG projects. B) Change this program node so that it does not "Allow parallel execution on the same server" a. Go to the properties of the program node b. Select the section named “Code Submission” c. Click the bullet for the option “Customize code submission options” d. Make sure "Allow parallel execution on the same server" is NOT checked e. Save the changes C) Run this program node before running the rest of your project a. Note: Because this program node has had the "Allow parallel execution on the same server" turned off b. To automate things a little bit, I created an Autoexec process flow with this code node in it (Good info on Autoexec process flows: You asked for it: the Autoexec process flow - The SAS Dummy) Step 3: Use this path to store any datasets that are used by other branches/nodes of your EG project A) Read in the path from the fixed location a. I used the following statements to call the code I had saved previously: %let projectdir=%sysfunc(compress(&_clientprojectpath,,kn)); %include "/sas/data/g_research/&sysuserid/paths/&projectdir..sas"; b. To automate things a bit more, I put these two lines of code in the following to option locations in EG “Insert custom SAS code before task and query code” “Insert custom SAS code before submitted code” Note: This will cause the libname to be assigned prior to every task or node in the process flow. A little bit of overkill, but the code is small and the assignment is temporary. B) Assign a Library to this path a. By using a %include referencing code that contains a libname statement, I have already accomplished this. If another method is used, you may need to have this be a separate statement that is run for every task and node in the EG process flow. C) Use the library to read and write any datasets that need to be in common. a. I tested this with many program nodes that all looked like the following: data workshr.temp2; set sashelp.cars; run; b. I also tested this with a Query Builder task which also worked Note: To automate the libname assignment some, I chose the “workshr” directory as the first “Default library for Output Data” option in EG which seems to apply to all tasks, but not programs. It seems a little contrived, but I chose the method that I did so that I have it set up and available to any EG project that I run. It also doesn’t require any changes for a project that that does not use "Allow parallel execution on the same server" option (i.e. unchecked). Some drawbacks that I would like to find solutions to: 1. I created a “/path/” folder for all of the library paths to be stored in. This “/path/” directory would have to be added to any users home directories who use my project. This can be fixed with a more common or shared path… but that would mean more possibilities of corrupting that filing during process flow running. 2. Anyone else who used my project would have to add a %include statement to their “Insert Custom SAS Code…” options unless I put the %include in the program nodes themselves. 3. Turning the option "Allow parallel execution on the same server" on and off for the same project causes some weird behavior that looks like errors, but seems to still run correctly. 4. I believe that this only works for datasets and that options and macro variables that are assigned in any of these tasks or program nodes would not be able to be used in any other task or program node. Anyone no how to change the storage location of these other values? 5. If you are not careful, you could have locking issues or precedence issues where you ty to read a dataset that has not been created yet. 6. In order to apply this to existing EG projects, you would have to change the libraries for nearly every step to the shared work library. Has anyone come up with a good way to redirect the WORK library in the middle of running code? 7. I am sure there are others…. But I can’t think of them right now. Any suggestions on how I could improve this?

SNLFAM1 · ‎08-19-2014

I have been experimenting with the "Allow parallel execution on the same server" option in Enterprise Guide 5.1 (What was the first version that contained this option?). It seems to me that there should be a way to automatically generate and store the path of the WORK directory that is created when EG connects to the workspace server the first time so that you can call it automatically in code that is running in parallel. Is this possible? Would I then (using prompts?) be able to insert this stored path in a libname statement so that certain parallel tasks and programs can read and write to the same work directory? I would like all of this to be done in an automatic and repeatable way. I have been trying to use the "Insert SAS Code.." and "Submit SAS Code.." options in EG to make this an automatic and the default method of submitting process flows. I have even considered altering the default output library (for tasks only?) so that all of this happens without having to alter existing EG process flows too much. Any thoughts or suggestions on a way that I could take full advantage of the capabilities of the "Allow Parallel Execution on the Same Server" but store the data all in the same temporary location?Any good papers or documentation on this option that I have yet to find? Notes: - We do not have SAS/CONNECT which would seem to resolve this issue somewhat. - We also do not have a grid enabled environment, but that may be coming in the future. - I also do not want all of these tables to be stored in a "permanent" location which is why I want to send the data to a "shared" WORK directory. - I am well aware of the potential consequences of allowing users to run MANY SAS workspace sessions with the click of a button, but our user base is small and I am confident that with proper training and monitoring we can take advantage of these features without undue stress on our server. - We are moving to EG 6.1 soon if that version treats this differently. ** It seems that this would be a useful feature (with proper role permissions) to program as part of EG out-of-the-box functionality to be able to use this option seamlessly in the way I describe for future versions of EG. Any word of this feature being worked on by developers? And since I am wishing, any chance EG could write all of this out to a stored process that takes advantage of parallel processing using multiple "sessions"?

SNLFAM1 · ‎04-11-2014

Hi Chris, Thank you for this tool! It is a time (life) saver. I am curious why there is not a feature to search an EG project from within EG? Not across projects, but within a single project. Edit>>Find works well within a single object, but EG projects are often multiple objects in multiple process flows. It would be nice to have an "entire process flow FIND" or even an "entire EG project FIND". Does this exist, or do you know if they are working on this? Shaun L.

SNLFAM1 · ‎03-04-2014

Is there anyway to use Sybase options (Ex. set showplan on) or Sybase Stored Procedures through SAS? Can you do this with a metadata defined libname to the database? Is it possible using the pass-through facility for Sybase? Can I get the results/log back in my SAS log, or in a SAS dataset, or to an external file? I have looked at the "SASTRACE" option (SAS/ACCESS(R) 9.3 for Relational Databases: Reference, Second Edition) . That gives information about the 'translated' query that gets passed to Sybase and has a lot of good time statistics. I have also looked at the Sybase specific information (SAS/ACCESS Interface to Sybase), but that didn't seem to have anything. I did notice the DBCONINIT libname option which may be part of what I am looking for, but I can not tell from the documentation (DBCONINIT= Libname Option). The SQL Pass-Through EXECUTE statement (EXECUTE Statement) also looks promising, but I have not been able to get it to work the way I would like. I am performing a lot of testing for our Sybase DBAs to confirm that the SAS interface is working the same as their ISQL sessions and would like to be able to hand them log/results that show items like the Sybase index that was used in the processing. There may be an easier way to do this, but right now, I am having to infringe on the DBAs time for these tests, and they do not have much/any to spare. Thanks for any help!! Shaun L.

SNLFAM1 · ‎02-21-2013

Thanks Cynthia. I have been in contact with SAS support about this issue. I am not sure if it is because Microsoft Office 2013 is "not supported" by SAS at this time or if it is something else, but getting any information about this has been very difficult. SAS support did tell me all of the information included in my initial question, but that is about it. I am hoping that someone else is in the same boat as me and has some wisdom I might glean, and I would be happy to return the favor as I am being asked to begin testing AMO 5.1 in 64-bit Office 2013 next week. Thanks again for your response!

SNLFAM1 · ‎02-12-2013

Has anyone else had experience with the new Microsoft Office 2013 and it compatibility with AMO 5.1 and PCFiles Libname engine in SAS 9.3M2? My organization is planning to move to Office 2013 in the next few months and I have been tasked with testing to see what works and what doesn't. My understanding is that AMO 6.1 will eventually be released and supported for Office 2013, but that 5.1 is not. Anyone else in a similar situation? Any reason (other than it is not supported) that we could/should not use AMO 5.1 and PCFiles Libname engine in 9.3M2 in Office 2013?

Online Status	Offline
Date Last Visited	‎11-11-2019 10:58 AM

Import XML File to Multiple SAS Datasets Without Re-Reading XML

Re: Allow Parallel Execution on the Same Server?

Allow Parallel Execution on the Same Server?

Re: How to search SAS Enterprise Guide files

Sybase Options and Stored Procedures through SAS

Re: Microsoft Office 2013 and it compatibility with AMO 5.1 and PCFile...

Microsoft Office 2013 and it compatibility with AMO 5.1 and PCFiles Li...

Re: how do I check the length of a variable

Macro variable _ClientProcessFlowLabel

Enterprise Guide: Improve Auto Arrange

Support for data step debugger

Provide the ability to SELECT * in a query builder

Re: Allow Parallel Execution on the Same Server?

Import XML File to Multiple SAS Datasets Without Re-Reading XML

Re: Allow Parallel Execution on the Same Server?

Allow Parallel Execution on the Same Server?

Re: How to search SAS Enterprise Guide files

Sybase Options and Stored Procedures through SAS

Re: Microsoft Office 2013 and it compatibility with AMO 5.1 and PCFile...

Microsoft Office 2013 and it compatibility with AMO 5.1 and PCFiles Li...