BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
thewan
Quartz | Level 8

I would like to parallelize inside a sas proc. I want to run:

proc lifereg data=main;
 by SampleNo;
 model statement etc.
run;

Where each sample will execute at the same time. Is that something that is possible? Thank you.

 

I went through this paper. http://www.lexjansen.com/wuss/2018/15_Final_Paper_PDF.pdf One of its main takeaways is that any proc or data steps that don't rely on one another can be run at the same time.

 

1 ACCEPTED SOLUTION

Accepted Solutions
sbxkoenk
SAS Super FREQ

Hello @thewan ,

 

A by-statement is very efficient! It's probably not processing your samples in parallel (as there's only one SAS session in 'the game') but it's very efficient. Much more efficient than looping over your samples.

If you are talking about parallel processing, you should mention your SAS version (SAS 9.4 Mx or SAS VIYA 3.5+).
Submit 

%PUT &=sysvlong4;

to find out.

Anyway I think you can use MP Connect (Multi-Process Connect) if you have SAS/Connect.

 

It works very well and I use it a lot.
Here's some info:

  1. Running SAS programs in parallel using SAS/CONNECT®
    By Leonid Batkhan on SAS Users January 13, 2021
    https://blogs.sas.com/content/sgf/2021/01/13/running-sas-programs-in-parallel-using-sas-connect/
  2. Base SAS + SAS/CONNECT - A simple method to generate load on any number of licensed cores
    Posted 04-08-2021 05:46 AM | by SimonWilliams
    https://communities.sas.com/t5/SAS-Communities-Library/Base-SAS-SAS-CONNECT-A-simple-method-to-gener...

If you have six samples you launch 6 concurrent SAS sessions, each one dealing with one sample.
Like here:

options sascmd="sas";

/* Process 1 */
signon  task1;
rsubmit task1 wait=no;
 PROC LIFEREG DATA=;
 where SampleNumber="1";
 MODEL ...;
 RUN;
endrsubmit;

/* Process 2 */
signon  task2;
rsubmit task2 wait=no;
 PROC LIFEREG DATA=;
 where SampleNumber="2";
 MODEL ...;
 RUN;
endrsubmit;

/* ... ... ... */

/* Process 6 */
signon  task6;
rsubmit task6 wait=no;
 PROC LIFEREG DATA=;
 where SampleNumber="6";
 MODEL ...;
 RUN;
endrsubmit;

waitfor _all_;
signoff _all_;
/* end of program */


Kind regards,
Koen

 

View solution in original post

3 REPLIES 3
sbxkoenk
SAS Super FREQ

Hello @thewan ,

 

A by-statement is very efficient! It's probably not processing your samples in parallel (as there's only one SAS session in 'the game') but it's very efficient. Much more efficient than looping over your samples.

If you are talking about parallel processing, you should mention your SAS version (SAS 9.4 Mx or SAS VIYA 3.5+).
Submit 

%PUT &=sysvlong4;

to find out.

Anyway I think you can use MP Connect (Multi-Process Connect) if you have SAS/Connect.

 

It works very well and I use it a lot.
Here's some info:

  1. Running SAS programs in parallel using SAS/CONNECT®
    By Leonid Batkhan on SAS Users January 13, 2021
    https://blogs.sas.com/content/sgf/2021/01/13/running-sas-programs-in-parallel-using-sas-connect/
  2. Base SAS + SAS/CONNECT - A simple method to generate load on any number of licensed cores
    Posted 04-08-2021 05:46 AM | by SimonWilliams
    https://communities.sas.com/t5/SAS-Communities-Library/Base-SAS-SAS-CONNECT-A-simple-method-to-gener...

If you have six samples you launch 6 concurrent SAS sessions, each one dealing with one sample.
Like here:

options sascmd="sas";

/* Process 1 */
signon  task1;
rsubmit task1 wait=no;
 PROC LIFEREG DATA=;
 where SampleNumber="1";
 MODEL ...;
 RUN;
endrsubmit;

/* Process 2 */
signon  task2;
rsubmit task2 wait=no;
 PROC LIFEREG DATA=;
 where SampleNumber="2";
 MODEL ...;
 RUN;
endrsubmit;

/* ... ... ... */

/* Process 6 */
signon  task6;
rsubmit task6 wait=no;
 PROC LIFEREG DATA=;
 where SampleNumber="6";
 MODEL ...;
 RUN;
endrsubmit;

waitfor _all_;
signoff _all_;
/* end of program */


Kind regards,
Koen

 

thewan
Quartz | Level 8

Thank you, your example code was what I needed!

ChrisNZ
Tourmaline | Level 20

The list of multithreaded procedures is available here:

https://support.sas.com/rnd/scalability/procs/

If you can't use one of these, you can maybe:

1. Ask SAS R&D to add proc lifereg to the list

2. Try to program the logic you need using proc DS2

3. Break the table into subsets, and multithread using MPConnect as shown by @sbxkoenk 
    You could also multithread using call system() but you lose any ability to synchronise the jobs 

 

Note that:

- You first need to ensure that you bottleneck is the CPU. If it's the disk, you'll only make things worse by multithreading.

- There are overheads when multithreading and the gains may not be as expected

 

call dosubl() is still limited to sequential processing sadly. 

 

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 1566 views
  • 4 likes
  • 3 in conversation