Solved: Creating temporary DB table using implicit pass through takes too long

kc_dn · Posted 07-29-2022 02:40 PM

We use SAS 9.4 on a Linux platform. All Studies share the same MS SQL DB Tables and I only want to overwrite the data for the Study currently being processed. So, I wrote a macro that:

Creates a temporary DB table from the current Study’s SAS dataset.
Deletes the current Study’s data from the permanent DB table.
Loads the temporary DB table into the permanent DB table.
Drops the temporary DB table.

Everything works beautifully. However, since we cannot create a MS SQL Server table from a SAS dataset using explicit pass through SQL, I have to create the temporary DB table using implicit pass through.

PROBLEM: Creating the temporary DB table using implicit pass through takes over 2 hours to load 5 million records. Is there a better way of doing this? Below is my macro.

Any help much appreciated 🙂

SASKiwi · Posted 07-30-2022 01:08 AM

In addition to @Reeza 's suggestion adding the options UPDATEBUFF and DBCOMMIT to your STR LIBNAME statement can make a HUGE difference in data loading performance. The default values for these options are not very efficient in my experience. You will probably need to experiment with different values to get the best load times. I suggest you try values like 1,000, 5,000, 10,000 and so on, but test it on a subset of your 5 million rows to speed up the testing process.

View solution in original post

Reeza · Posted 07-29-2022 03:20 PM

Rather than PROC SQL SELECT try a PROC DATASETS COPY.

proc datasets lib=atr;
*copy over final to atr library;
copy in=work out=atr;
select final;
*rename final data set to temp;
change final=TEMP;
run; quit;

The SQL step is reading each line one at a time, where COPY will copy in bulk.

If you have the correct SAS license (SAS/ACCESS) you can also try SAS DBLOAD.

https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/acreldb/n0o4e1x3ggqlxan12sokc2lbgxbh.htm

kc_dn · Posted 08-02-2022 08:34 PM

Thanks! I tested it on a million records and it provides a slight improvement over PROC SQL if paired with DBCOMMIT=10000 INSERTBUFF=10000 as suggested in the other reply.

NOTE: PROCEDURE DATASETS used (Total process time):
real time 8:56.77
cpu time 18.00 seconds

NOTE: PROCEDURE SQL used (Total process time):
real time 9:15.28
cpu time 17.61 seconds

SASKiwi · Posted 07-30-2022 01:08 AM

In addition to @Reeza 's suggestion adding the options UPDATEBUFF and DBCOMMIT to your STR LIBNAME statement can make a HUGE difference in data loading performance. The default values for these options are not very efficient in my experience. You will probably need to experiment with different values to get the best load times. I suggest you try values like 1,000, 5,000, 10,000 and so on, but test it on a subset of your 5 million rows to speed up the testing process.

kc_dn · Posted 08-02-2022 08:32 PM

Thanks! I tried all three values on a million records and it appears DBCOMMIT=10000 INSERTBUFF=10000 provides the most improvement.

NOTE: PROCEDURE DATASETS used (Total process time):
real time 8:56.77
cpu time 18.00 seconds

NOTE: PROCEDURE SQL used (Total process time):
real time 9:15.28
cpu time 17.61 seconds

SASKiwi · Posted 08-02-2022 09:10 PM

@kc_dn - Woohoo! That's a massive improvement! Can't complain about loading a million rows in 9 minutes. Bulk loading might improve things further but if you are happy with the current load times I wouldn't bother.

Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Re: Creating temporary DB table using implicit pass through takes too long

Registration is open

SAS Training: Just a Click Away