BookmarkSubscribeRSS Feed
deleted_user
Not applicable
Hello All,

Can you tell me the pros and cons of using temporary dataset rather using permanent dataset in di studio?

As far as I know, in batch mode using permanent datasets will cause overhead of storage area but in interactive mode it would be helpful interms of simplifying the job development and testing.

Would be great if you can listout more details or mention a link(website) which covers the same.

Thanks in advance.
2 REPLIES 2
sbb
Lapis Lazuli | Level 10 sbb
Lapis Lazuli | Level 10
The only difference with a temporary dataset is that your job cannot be restarted, depending on how the temporary dataset is used. Also, typically a temporary dataset's disk space usage is allocated to a different DASD storage pool than would be for a permanent dataset -- but that is not a guaranteed consideration unless you were to verify with your Data Storage Management staff. So, depending on how critical the restart/recovery of your batch process, I would say that the "overhead" is a non-issue and you should be focused on whether it is or is not important to have restartability for your SAS batch processing.

Regarding a technical reference or website, you would be best served by discussing the topic/point specifically with your Data Storage Management personnel.

Scott Barry
SBBWorks, Inc.
LinusH
Tourmaline | Level 20
If you have average/complex data loading jobs, with transformations and cleansing, it's best practice to have data "flow" into the warehouse in steps, such as ODS, staging, transformed daily bulk, target detail DW, star-schema DM, information marts etc. Each step should be in a permanent data storage. It's also handy to keep each job just to load one level at the time, allowing you to organize scheduling in a simplified way (remember not to allow jobs to refer to data created in subsequent job in the data flow).

In 9.2, there is a restart feature which allow you to restart a job at specified point, which could be a temporary table (SAS will make the temporary table temporarily permanent between runs).

If you still feel that you need to save some intermediate data between sessions, you can have them defined as permanent during development, and then change to temporary before going to test/prod.

/Linus
Data never sleeps

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 1387 views
  • 0 likes
  • 3 in conversation