Old contains let's say the first run using the hash lookup solution you created (thanks again for that one!). So that one created this household grouping household 1 - {1,4} household 2 - {4,5} household 3 - {6,7} In the future I would like to execute this household exercise again, chances are the household ids we used will not be the same as its incremental, so that new run will have something like this. household 2 - {1,2,3} household 4 - {5,6,7} household 5 - {8,9} (new household not existing in the previous run) Next thing I want to do is to match the original result and the new result. using the household ids there as still the "real ids". household 1 - {1,2,3} household 2 - {4} household 3 - {5,6,7} (id 5 will have its household replaced from 2 to 3) household 4 - {8,9} (inserted to the original set, then getting the max + 1 as household id) If you're questioning why 5 got moved to a different household, this is an example that there was no links between 4 and 5, but 5,6,7 had so it moved to a different household. Sorry if my explanation is a bit confusing.
... View more