Hi, I'm starting to use DataFlux. I need to find duplicates in table. The documentation suggests to use the following nodes in a data job: -Match code -Clustering -Surviving Record Identification -Entity Resolution File Output I don't understand how to configure these nodes and if this way is right to find duplicates and after to delete them. Can you help me? Thank you
It touches on some of the topics you are asking about, beyond this, you'll have to find your way through the Online Docs to get more information about each individual node.
Good luck,
Ahmed
Special offer for SAS Communities members
Save $250 on SAS Innovate and get a free advance copy of the new SAS For Dummies book! Use the code "SASforDummies" to register. Don't miss out, May 6-9, in Orlando, Florida.
Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.