I would like to be able to retain the cluster id assigned to customers in previous runs of a data flow in Data Managerment Studio 2.6.
I have handled this in previous versons of dataflux with the Cluster Update node in dfpower architect 8.1/8.2. So far I haven't identified if this is still an option in or replaced by a different node.
Cluster Update node did not guarantee to preserve clusterIDs of already clustered data rows. Whether or not the existing clusterIDs would change depends on whether any of the new data rows match into multiple existing clusters.
Cluster Update node was deprecated, you can use Cluster node instead.
In you Clustering job, make sure the “old” records are passed into the Cluster node in identical order each time, followed by the new records. This will guarantee that the “old” records will retain their original cluster IDs, as long as those clusters are not joined by the new records being added.
Cluster Update node did not guarantee to preserve clusterIDs of already clustered data rows. Whether or not the existing clusterIDs would change depends on whether any of the new data rows match into multiple existing clusters.
Cluster Update node was deprecated, you can use Cluster node instead.
In you Clustering job, make sure the “old” records are passed into the Cluster node in identical order each time, followed by the new records. This will guarantee that the “old” records will retain their original cluster IDs, as long as those clusters are not joined by the new records being added.
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.
Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.
Find more tutorials on the SAS Users YouTube channel.