BookmarkSubscribeRSS Feed
bernhard
Calcite | Level 5

Hi all

I am using this paper http://support.sas.com/resources/papers/proceedings10/109-2010.pdf to parallelize a data step (which is essentially a do loop simulating something).

I wonder what the best approach is to generate random numbers for the multiple threads. Right now I am initizalizing with different seeds "CALL STREAMINIT(&seed.); " where seed is different for each thread/task.

However this is not optimal since the different paths of the Pseudo-RNG could "coincide" after some time (if I am unlucky). Is there any way to improve this situation? Is there any documentation available on the Mersenne Twister configuration in SAS? I am only relying on  RAND in my code.

Thanks

Bernhard

3 REPLIES 3
Rick_SAS
SAS Super FREQ

Yes, in theory you could get unlucky.  In practice I suspect that the probability of getting two streams that are not independent are extremely small. I haven't run the analysis myself, but other people have done what you are proposing and have not reported any dependencies.

If you want to see the literature in this area, search for articles by Piere L'Ecuyer and colleagues on PRNGs that have  provably independent streams.

bernhard
Calcite | Level 5

Thank you Rick.

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 1243 views
  • 3 likes
  • 3 in conversation