BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Rama_V
Obsidian | Level 7

Hi Everyone,

 

In our organisation we have to create match codes for huge to data(million records) for clustering.

 

I can understand you can create match codes and clustering using Data Management Studio and schedule on DM Server.

Currently, the development has been done on SAS code(using DQmatch function) job and scheduled on the server.

 

I can't justify which one is best approach to improve performance on same server. Using SAS code is taking hours to run.

 

So I am just wondering has anyone experience similar situation or have any information around this.

 

Please any assistance would be really appreciated.

 

Thanks,

Rama

 

 

 

 

 

1 ACCEPTED SOLUTION

Accepted Solutions
Patrick
Opal | Level 21

Match code generation is a very resource intensive process. It always uses DataFlux whether you call this functionality now out of SAS or directly out of a DF job so I don't believe going for a DF job will improve performance.

 

What you could do to improve end-to-end runtimes:

1. Set-up parallel jobs each creating match-codes for a sub-set of your source data

2. Design and implement delta processing so don't re-create all match codes every single time but only create match codes for new or changed records.

 

 

View solution in original post

2 REPLIES 2
Patrick
Opal | Level 21

Match code generation is a very resource intensive process. It always uses DataFlux whether you call this functionality now out of SAS or directly out of a DF job so I don't believe going for a DF job will improve performance.

 

What you could do to improve end-to-end runtimes:

1. Set-up parallel jobs each creating match-codes for a sub-set of your source data

2. Design and implement delta processing so don't re-create all match codes every single time but only create match codes for new or changed records.

 

 

Rama_V
Obsidian | Level 7

Thanks Patrick.

 

I really appreciate your inputs.

 

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to connect to databases in SAS Viya

Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 872 views
  • 0 likes
  • 2 in conversation