BookmarkSubscribeRSS Feed
TSarthaka2018
Calcite | Level 5
Hi, I have about 2 TB of data that requires data prep and other operation and eventually leading to Model development using Linear Regression.

Please help me with all the procedures that you use to process such huge data while saving time.
3 REPLIES 3
Ksharp
Super User

You could check any PROC start with HP .
Like:
PROC HPGENSELECT
PROC HPLOGISTIC

 

Ksharp_0-1736500372186.png

 

LinusH
Tourmaline | Level 20
If this is one off, I don't think it doesn't matter so much.
If this is a recurring work that should be executed regularly - try to do data prep "and other operations" on only new data.
Data never sleeps
SASKiwi
PROC Star

Take a random sample of your data and do all of your data preparation and modelling development on that. Once you have got it working OK, then try it on the full data.

sas-innovate-wordmark-2025-midnight.png

Register Today!

Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.


Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 332 views
  • 0 likes
  • 4 in conversation