Dealing with SAS data sets/tables of such sizes requires Architectural and Coding special considerations and approaches. From System Architecture/Infrastructure point of view I/O subsystem and it's sustained throughput is something you'll need to pay special attention to. Not sure what's your level of influence within your Organization, and at what stage of the project you are in, but if you want to ensure efficient utilization and acceptable run times, then you need to get the right Storage specifications in place. The correct Ratio of GB of RAM per CPU/CPU Core (4|6|8 GB of RAM per CPU/CPU Core) has an impact on the performance and response times. From SAS Storage Engine point of view SAS Base Engine data set: Typically suitable for tables under 20 GB in size (Some people may disagree with this!). Each data set is a single binary file, therefore you have to look at it from Processing (Reading, Updating), Backing-up, and Restoring point of views. SAS SPD Engine data set: Best suited for processing large tables and reducing response times. It requires deeper technical skills to setup and use. Check out the following link for additional details SAS® Scalable Performance Data Engine, There are links to When to use, System Requirements, and other useful links. From SAS Coding point of view I think you have already got some wonderful and insightful responses already. There is not much I can add here. Good luck, Ahmed
... View more