I was reading a May 10th, 2023 SAS-L1 email digest about issues with running mergers with very large data sets (or tables). An email was sent about an individual having long run times with mergers and hash joins and wanted to see how to speed up mergers taking 6 hours or more and HASH joins taking 8 Hours.
From my experience, there are many issues over the years where Data Mergers and/or SQL joins were taking too long usually; sometimes 2 or more days of sorting and merging 2 files. My correction for the 2-day run using USER FORMATS resulted in a 10-minute merge. A common issue in processing epidemiology and consumer data analysis. Using the USER FORMAT reads the larger file without having to sort it and creates a smaller large file that can then be sorted in quicker time.
I have posted a LinkedIn post that showed some examples of this issue and added a few references on PROC FORMAT:
Jonas V. Bilenas