SAS Data Integration Studio, DataFlux Data Management Studio, SAS/ACCESS, SAS Data Loader for Hadoop and others

job in parallel

Reply
Contributor
Posts: 31

job in parallel

Hi,

I need to develop a SAS Data integration Studio job (call it myjob) that runs some jobs in parallel.

When I drag and drop a jobs (call them job1, job2, etc.) in myjob, the tool automatically create an execution order between job1 and job2.
So I cannot avoid an execution order and forced to run the job in a sequential order.
But I would run the jobs in parallel, or run some tasks in parallel. How can I do it?

Thanks.

Super User
Posts: 5,849

Re: job in parallel

Basically, i defer from using DI Studio's job-in-job functionality unless I want to build parametrized loop jobs.

I think that defining isolated (non-iterative/parametrized) job sequences is best done in your scheduling tool/software (for some like cron/WinTask and LSF) can be defined even in SAS Management Console.

On drawback I see to use job-in-job is the lengthy logs, hard to monitor etc.

 

That said, there is a Fork transformation that might work for this (haven't tried it with jobs, only "normal" transformations).

Data never sleeps
Super User
Posts: 5,849

Re: job in parallel

Edit: just tested the Fork, and it seem you can not map jobs to it, only other transformations.

So I guess my first recommendation is still the most viable...

Data never sleeps
Contributor
Posts: 31

Re: job in parallel

I've created a loop transformation. The job included in the loop uses an Execute Transform, that should call an Oracle stored procedure for each parameter values. Actually, the user written code of this transform is:

 

proc sql;

connect to oracle (user=DWH
                   password=<password>
                   path=GEC);

execute(execute myproc('CLIENTE', 11)) by oracle;

disconnect from oracle;

quit;

i would sobstitute the 11 value with a parameter but SAS doesn't recognize the syntax of the parameter, because "11" is in the Oracle code statement.

 

Super User
Posts: 5,849

Re: job in parallel

This sounds like a slightly different question, and it's best practice to create a new thread, that would make the issue and its potential solution easier to find.

 

That said, what have you tried?

I think a simple macro variable (parameter) evaluation would work.

Data never sleeps
Contributor
Posts: 31

Re: job in parallel

Before correcting the question in my last post, I've commented the user code in the Execute Transform.

 

I've tried to create the job_test:

job_test.png

But the loop runs for 2-3 minutes and ends in error.

 

The job cliente_NETAS_test ends without errors when Loop and Loop End are disabled.

 

The parameter details follow:

 

cliente_NETAS_test.png

loop.png

Contributor
Posts: 31

Re: job in parallel

The errors after runing the job is:

 

job_test.png

Ask a Question
Discussion stats
  • 6 replies
  • 201 views
  • 0 likes
  • 2 in conversation