BookmarkSubscribeRSS Feed
Alexsander
Fluorite | Level 6

Hi All,
     I have performance issue with my report code.
My code takes around 69 minutes to complete the report. One “Proc report” procedure takes 66 minutes to complete the step.
The data has 104 columns and 168,745records, the physical size of my data is 450 MBs.

Below is my proc report code and fulltimer options log:

Code:

Proc report Data=SMALL_MIS_FINAL center style(header)=[background=cx113388 color=cxffffff]
style(summary)=[font_style=roman font_weight=bold] missing split= '~' spanrows;
Column column1 to column_103 row_count;
Define row_count / "Row Count" computed noprint;
Compute row_count;
obs+1;
row_count=obs;
If int(row_count/2)=row_count/2 Then Do;
CALL DEFINE (_row_,'STYLE','STYLE=[BACKGROUNDCOLOR=CXEFEFEF]');
End;
Endcomp;
Run;

Log:

NOTE: There were 168745 observations read from the data set WORK.SMALL_MIS_FINAL.
NOTE: At least one W.D format was too small for the number to be printed. The decimal may be shifted by the "BEST" format.
NOTE: PROCEDURE REPORT used (Total process time):
      real time           1:06:14.57
      user cpu time       1:06:22.96
      system cpu time     7.24 seconds
      memory              860845.95k
      OS Memory           903724.00k
      Timestamp           09/16/2014 06:10:33 PM
      Page Faults                       0
      Page Reclaims                     68733
      Page Swaps                        0
      Voluntary Context Switches        9113
      Involuntary Context Switches      13645
      Block Input Operations            976
      Block Output Operations           1554480

Please let me know how to reduce cpu/ real time in this step.

Thanking you in advance.
Regards,
Alex

7 REPLIES 7
RW9
Diamond | Level 26 RW9
Diamond | Level 26

Hi,

Some thoughts;

What ODS destination are you using?  Do you send to a file?

I am not sure what you are attempting to ouput without test output as you appear to be computing OBS each record which is a big straing on resources.

Also the in statement with the function calls may take a while as has to happen each time.

I would suggest you look at what you want out and sort you data out first, e.g. put calculations in a datastep prior to the report procedure, and just have a simple proc report.  If you provide test data and desired output I could suggest further.

Edit: Maybe something like:

data want;
  set sashelp.cars;
  if mod(_n_,2)=0 then skip=1;
run;
proc report data=want;
  column make model type skip=skipn row_count;
  define row_count / "row count" computed noprint;
  define skipn / analysis noprint;

  compute row_count;
    if skipn=1 then call define (_row_,'style','style=[backgroundcolor=cxefefef]');
  endcomp;
run;

Alexsander
Fluorite | Level 6

Hi RW9,

     Thank You for your thoughts on this.

We are using Excel as ODS destination. we create excel file as output file (infact through XML file).

I have removed the compute statment in my proc report code. but still it takes more than 60 minutes to create excel file for 168,000 records.

Please let me know if you have any other questions/ suggestion on this.

Again Thank you for your valuable time!.

Regards,

Alex

RW9
Diamond | Level 26 RW9
Diamond | Level 26

Well, I wouldn't recommend exporting that much data to Excel.  Firstly only the most recent releases of Excel will even handle that kind of data, and you won't be able to use it in any meaningful way in Excel.  As for creating the file, well its possible that it does take that long.  Remember the ods system is running through a tagset conditionally outputting a lot more text per "cell" as XML is very verbose.  So your 168000 records, depending on column count could run into hundreds of mb of XML.

Really, go back and asses what you are trying to do with the data, if its transfer of data from one person to another use CSV not XML.  If someone wants to review 168,000 rows * x number of columns - well best of luck.  If its a datalisting to go in a report, maybe use PDF.

Cynthia_sas
SAS Super FREQ

Hi, you said that you are using "Excel" as the ODS destination, but you do NOT show the ODS code. Are you using ODS CSV? ODS HTML? ODS MSOFFICE2K? ODS TAGSETS.EXCELXP? or the experimental ODS EXCEL in SAS 9.4? When you use many of the ODS methods, you are NOT creating a true, binary Excel file, you are merely creating an ASCII text file that Excel knows how to open and render.

I agree with DATA _NULL_ that you are outputting a lot of rows and it seems like a lot of columns to your output with PROC REPORT and, so it is hard to debug performance issues over the Forums without the complete code and the some data to test. Given the size of your data and the incomplete information about your ODS destination, other factors that could be complicating your running time would be the operating system you're using and the version of SAS you're using, as well as the version of Office you're using to render the file.

To me, all of this points to opening a track with Tech Support, where they can look at ALL of your code (including ODS statements), a sample of your real data, and they can collect the necessary information to really do debugging and see what is causing the file to take so long to be created.

cynthia

Alexsander
Fluorite | Level 6

Hi Cynthia  / Data _null_ /RW9,

     Thank you for your  input on this.

I use ODS TAGSETS.EXCELXP Example "ODS TAGSETS.EXCELXP options(sheet_name='Parameter' absolute_column_width='20,40' width_fudge='1.0' autofit_height ='yes' embedded_titles='yes' orientation='landscape' sheet_interval='PROC' )", using  SAS v9.3 in Linux machine.

The excel/xml file output size is around 300 MBs. The number of column is 103 and number of row is 168,000.

My complete report code takes 69 minutes, out of this, only proc report procedure takes 66 minutes to write it in excel/xml file. the remaining part (Joining different tables, formating etc.) takes only 3 minutes.I will check with SAS tech support.  let me k now if you guys have any other solutions to reduce proc report run time.

Thanking you,

Alex.

data_null__
Jade | Level 19

How big is the XML file you are creating?

How long does it take for EXCEL to load it?

Message was edited by: data _null_

OS2Rules
Obsidian | Level 7

You also have not given any specs on the system that you are running on.  Obviously

running on a z/80 is not a fast as running on a z/OS system.

I create a 550 Excel spreadsheets from about 980K records and it still takes me about

30 minutes on a 8 way server.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 7 replies
  • 2049 views
  • 0 likes
  • 5 in conversation