BookmarkSubscribeRSS Feed
AlmavivaSAS
Calcite | Level 5

Hi, I'm starting to use DataFlux. I need to find duplicates in table. The documentation suggests to use the following nodes in a data job:
-Match code
-Clustering
-Surviving Record Identification
-Entity Resolution File Output
I don't understand how to configure these nodes and if this way is right to find duplicates and after to delete them.
Can you help me? Thank you

1 REPLY 1
AhmedAl_Attar
Rhodochrosite | Level 12

Hi,

 

Check this SAS Global Forum paper How to Find Your Perfect Match Using SAS® Data Management

It touches on some of the topics you are asking about, beyond this, you'll have to find your way through the Online Docs to get more information about each individual node.

 

Good luck,

Ahmed

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to connect to databases in SAS Viya

Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 963 views
  • 0 likes
  • 2 in conversation