data merge quality

Reply
N/A
Posts: 1

data merge quality

how does one check the quality of datamerge, when you have merged two large datasets?

thanks!

PROC Star
Posts: 7,360

data merge quality

Carefully!  I typically run the same code on a smaller versions of the datasets, and check the results.  Whenever possible, I also try to run both proc sql and datastep merges on the smaller versions of the datasets and test to ensure that I get the same results.

Super Contributor
Posts: 1,636

data merge quality

the most important things for me:

  1. Check the number of variables and observations in the final dataset to make sure that the numbers are the same as I expected.
  2. Rename variables to avoid overwriting.
  3. If BY variables have to be Unique.
Super User
Super User
Posts: 6,499

data merge quality

One company I know went so far as to require users to perform all merges using their SMERGE macro.  (Safe MERGE?)

The use had to specifiy whether to keep the duplicate variable names from the first or last dataset in the merge.  It would do run-time checks to trap many-to-many merges.

Ask a Question
Discussion stats
  • 3 replies
  • 197 views
  • 0 likes
  • 4 in conversation