About Kurt_Bremser

Kurt_Bremser · ‎06-17-2014

How should the final code look like?

Kurt_Bremser · ‎06-17-2014

You can use the output statement to create a table with the results, then assign a format, and then use proc print or similar to display the results.

Kurt_Bremser · ‎06-17-2014

100,000 rows with 200 colums means that the resulting .html page would be at least 100,000*200*30 bytes (including all the HTML tags). That's 600M! Imagine transmitting that to your browser, and displaying it. There's a reason why SAS bucks before that hurdle. If you want to have users access to such big chunks of data, create a stored process that does the subsetting and writes a file for download. The drill through capability in WRS is meant for something that can meaningfully be inspected with eyeballs Mk I.

Kurt_Bremser · ‎06-17-2014

You may find it here: TS790 -Diagnosing SAS Enterprise Guide Connectivity Problems

Kurt_Bremser · ‎06-17-2014

8591 looks like a workspace server port, not the metadata server port, which is 8561 per default. Do you have access to the metadata repository? Which client are you using?

Kurt_Bremser · ‎06-05-2014

If you can get the sort done "on the fly", that is probably the simplest solution. Then the merge just reads sequentially through the tables and writes the target,so it all depends on the sequential I/O throughput of your SAS server.

Kurt_Bremser · ‎06-05-2014

Thanks all for the help, I now have something reasonable in place. For further information: when I set up the 9.2 environment, I decided to let SASApp stay in its pristine state, and instead created several different Application Server environments for our "customers". SASApp basically acts as a blueprint, but is not used otherwise. I then transferred the old 9.1.3 configuration into one of the "customer" trees. I have now converted this tree so that it follows the conventions of SASApp (including "inheritance" of configs) and gotten the logging to work in (mostly) "Info" level. Now I should be able to identify accesses to certain tables and also see user's errors directly on the server.

Kurt_Bremser · ‎06-05-2014

Using PROC FORMAT should not make for lengthy code. Just write a macro where you specifiy which variables to use, and necessary parameters like length, and then transform your input data into a valid cntlin file. After that, you only have to write one line to generate the format. Converting a lookup table into a format is one of the most efficient ways to avoid another long-running join, the other is using a hash object.

Kurt_Bremser · ‎06-05-2014

Thank you. So I can safely use $HOME/somewhere as the path to store the files.

Kurt_Bremser · ‎06-05-2014

But be aware, if you work with large tables, that a single proc SQL with multiple joins will create a very large utility file and produce a lot of I/O overhead. Writing more sort/merge steps saves disk space and execution time. Up to an order of magnitude, in my experience. YMMV. You see, the tables are not suddenly sorted by magic if you use SQL like this. SAS still needs to do the logical equivalent of a proc sort, and during sql this is done less efficiently. As long as everything fits into memory, this is not an issue, but as soon as the tables outgrow your available memory, the process may slow down to the "drying of paint".

Kurt_Bremser · ‎06-04-2014

Try this: proc sort data=b /* this is your original oracle data set */ (where = (var1<= date and var2>date and var3>date) out=dataset1 ; by var1; run; proc sort data=dataset2 (keep=var1 var2 var3) out=data2x ; by var1; run; data x; merge dataset1 (in=a) data2x ; by var1; if a; run; Compare this method and the SQL method using options fullstimer; Also watch the disks while the jobs are running; you may be surprised by the disk usage(s). I remember when I first came across a piece of code done by a SAS consultant that had > 100 lines. I quickly saw that I could do the same in one create table with ~ 10 lines in PROC SQL, so why bother with all that code? Then I had to wait 5 hours for my SQL to finish, while his code took about 20 minutes to produce the same result. With less than half the disk space.

Kurt_Bremser · ‎06-04-2014

Can you give me a quick overview which processes write these log files? My goal is to have each user's logs in their own home directory tree, so I need to be concerned with the correct permissions. If the workspace server process (that runs under the user's own login) itself writes the log, it won't be a problem.

Kurt_Bremser · ‎06-04-2014

You may find an answer here:

Kurt_Bremser · ‎06-04-2014

Get in touch with your database admin. That is most probably an error in the setup of your client/instance.

Kurt_Bremser · ‎06-04-2014

Once you get to REAL data sets (50 million rows is in this range), you need to take care of your storage infrastructure. a) make it FAST, using high-rpm disks or SSDs for the work area. If you are concerned about failsafes, use RAID1 (simple mirrors). If being failsafe is not a big thing, use striping b) separate your UTILLOC physically from the work/data location, and make sure these disks do nothing else. UTILLOC is where the intermediate file is stored during PROC SORT Then look at this: a) use a combination of proc sort and data steps to do the merge. PROC SQL is a resource hog of the nth order when it comes to large joins. Real life experience here has shown that SQL gets progressively slower when several processes are running, much more than the sort/merge steps. Up to a point where the server becomes unresponsive, which is very rare with an AIX system(!). b) indexes usually don't help (much), because in addition to the data, SAS needs to read the index, causing even more I/O. Indexes are very good if you need to access a small subset of data. c) identify which sort criteria will be needed most, and have your data sets already sorted correctly when you store them. That way users (including yourself) do not need to sort and can read the big datasets sequentially. d) when you do a data step merge, you need space for (just) the source files and the target files. With SQL, you also need space for the utiilty file, which will grow to a size equal of all the source files together. During the sorts preceding the merge, you only need extra space for the file being sorted, the temp file will be in UTILLOC

Online Status	Offline
Date Last Visited	Sunday