Hi.
I am using SAS University Edition in Windows 10 environment with VM Ware Workstation Player 12. I am now doing logistic regression and propensity score matching. My data consist of 1.5 million observations and 163 variables. The size of data file is about 450 M. Command consists of several macros and 250 lines. When I run the program, it stops and the message pops up.
This may be too big for SAS UE. The original intention was for learning purposes only and there are restrictions built into the software. I would contact SAS Tech Support directly for this and see if there's any work around. Because of the limitations of the software, one way to get around is to divide the data into smaller portions and process individually, if possible.
This may be too big for SAS UE. The original intention was for learning purposes only and there are restrictions built into the software. I would contact SAS Tech Support directly for this and see if there's any work around. Because of the limitations of the software, one way to get around is to divide the data into smaller portions and process individually, if possible.
Need to chime in with @Reeza: this data set is obviously larger than required for self paced learning purposes. You should be able to use a subset of your input data, and still be able to perform your training.
A thing you could do is to remove any work data set is not needed in your process as soon as it has played out its role.
Although the data file size is "only" 450 MB, this does not mean that a SAS dataset won't be considerably larger.
Let's make a quick calculation:
Assume your variables were all numeric (8 bytes), then a single observation would consume
163 * 8 = 1,304 bytes
Multiply by 1.5 million
1,304 * 1,500,000 = 1,956,000,000 bytes
equates to 1,910,156.25 Kbytes
equates to 1,865.387 Mbytes
equates to 1.8217 GB
Depending on the structure (char variables might be _considerably_ longer), the dataset might be much bigger than that.
UE is simply not designed for this quantity structure.
Inspect your data on the Stata side and reduce the number of observations.
Use the compress=yes option to minimize disk storage consumption.
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
What’s the difference between SAS Enterprise Guide and SAS Studio? How are they similar? Just ask SAS’ Danny Modlin.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.