I am trying to merge a table that has values in the TIER column with a new table I received that has updated some of the TIER fields. Just to get a quick look at which changed and which didn't, I wanted a simple table to compare the two, but noticed when I merged the datasets, the number of records increased by 13. I am trying to figure out why this is the case. Additionally, I noticed the warning that says there are more than one datasets with repeats of BY values, so that is probably a reason. The New_Tiers dataset only has two variables - ID and Tier_Update. Here is the log:
3633 data portal_&begin_date._&end_date._b1;
3634 merge portal_&begin_date._&end_date._b (rename = (tier=tier1) in=in1) New_Tiers (rename = (tier_update=tier2) in=in2);
3635 by ID;
3636 if in1;
3637 run;
NOTE: MERGE statement has more than one data set with repeats of BY values.
NOTE: There were 1085757 observations read from the data set WORK.PORTAL_090109_103110_B.
NOTE: There were 617 observations read from the data set WORK.NEW_TIERS.
NOTE: The data set WORK.PORTAL_090109_103110_B1 has 1085770 observations and 71 variables.
Any explanation on the repeat BY values and the 13 additional records popping up would be greatly appreciated.
Message was edited by: JonathanWarrick