Help using Base SAS procedures

Pros ans cons of using temporary ds rather permanent dataset in di studio

Reply
N/A
Posts: 0

Pros ans cons of using temporary ds rather permanent dataset in di studio

Hello All,

Can you tell me the pros and cons of using temporary dataset rather using permanent dataset in di studio?

As far as I know, in batch mode using permanent datasets will cause overhead of storage area but in interactive mode it would be helpful interms of simplifying the job development and testing.

Would be great if you can listout more details or mention a link(website) which covers the same.

Thanks in advance.
Super Contributor
Super Contributor
Posts: 3,174

Re: Pros ans cons of using temporary ds rather permanent dataset in di studio

Posted in reply to deleted_user
The only difference with a temporary dataset is that your job cannot be restarted, depending on how the temporary dataset is used. Also, typically a temporary dataset's disk space usage is allocated to a different DASD storage pool than would be for a permanent dataset -- but that is not a guaranteed consideration unless you were to verify with your Data Storage Management staff. So, depending on how critical the restart/recovery of your batch process, I would say that the "overhead" is a non-issue and you should be focused on whether it is or is not important to have restartability for your SAS batch processing.

Regarding a technical reference or website, you would be best served by discussing the topic/point specifically with your Data Storage Management personnel.

Scott Barry
SBBWorks, Inc.
Super User
Posts: 5,426

Re: Pros ans cons of using temporary ds rather permanent dataset in di studio

Posted in reply to deleted_user
If you have average/complex data loading jobs, with transformations and cleansing, it's best practice to have data "flow" into the warehouse in steps, such as ODS, staging, transformed daily bulk, target detail DW, star-schema DM, information marts etc. Each step should be in a permanent data storage. It's also handy to keep each job just to load one level at the time, allowing you to organize scheduling in a simplified way (remember not to allow jobs to refer to data created in subsequent job in the data flow).

In 9.2, there is a restart feature which allow you to restart a job at specified point, which could be a temporary table (SAS will make the temporary table temporarily permanent between runs).

If you still feel that you need to save some intermediate data between sessions, you can have them defined as permanent during development, and then change to temporary before going to test/prod.

/Linus
Data never sleeps
Ask a Question
Discussion stats
  • 2 replies
  • 208 views
  • 0 likes
  • 3 in conversation