getting the same result but spend a shorter time

PROC Star

Re: getting the same result but spend a shorter time

Posted 08-11-2021 12:32 PM (393 views) | In reply to Alexxxxxxx

The best answer may depend on whether there is a known, sorted order to have2.

Here is an approach to consider. I'm not sure if it's feasible, because I'm not sure how you want to handle duplicate entries. But there should be a few posters on this thread who will take the idea and run with it. (Sorry, I can't spend enough time to look up the details.)

Rather than creating a hash table, create an informat. The advantage: The hash table has to process the 300M records for each run (i.e., for each year). But an informat can be created once, and permanently saved. It also forces you to clean out any duplicates from HAVE2.

Once that is done, you would be appending to just a smaller (300K records) data set and the processing should be swift.

April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Discussion stats

15 replies
‎08-10-2021 11:18 PM
3505 views
10 likes
7 in conversation

Re: getting the same result but spend a shorter time

Registration is open

SAS Training: Just a Click Away