BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Rama_V
Obsidian | Level 7

Hi Everyone,

 

In our organisation we have to create match codes for huge to data(million records) for clustering.

 

I can understand you can create match codes and clustering using Data Management Studio and schedule on DM Server.

Currently, the development has been done on SAS code(using DQmatch function) job and scheduled on the server.

 

I can't justify which one is best approach to improve performance on same server. Using SAS code is taking hours to run.

 

So I am just wondering has anyone experience similar situation or have any information around this.

 

Please any assistance would be really appreciated.

 

Thanks,

Rama

 

 

 

 

 

1 ACCEPTED SOLUTION

Accepted Solutions
Patrick
Opal | Level 21

Match code generation is a very resource intensive process. It always uses DataFlux whether you call this functionality now out of SAS or directly out of a DF job so I don't believe going for a DF job will improve performance.

 

What you could do to improve end-to-end runtimes:

1. Set-up parallel jobs each creating match-codes for a sub-set of your source data

2. Design and implement delta processing so don't re-create all match codes every single time but only create match codes for new or changed records.

 

 

View solution in original post

2 REPLIES 2
Patrick
Opal | Level 21

Match code generation is a very resource intensive process. It always uses DataFlux whether you call this functionality now out of SAS or directly out of a DF job so I don't believe going for a DF job will improve performance.

 

What you could do to improve end-to-end runtimes:

1. Set-up parallel jobs each creating match-codes for a sub-set of your source data

2. Design and implement delta processing so don't re-create all match codes every single time but only create match codes for new or changed records.

 

 

Rama_V
Obsidian | Level 7

Thanks Patrick.

 

I really appreciate your inputs.

 

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

How to connect to databases in SAS Viya

Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 735 views
  • 0 likes
  • 2 in conversation