Hi everyone, I am trying to join tables again using the code below. userlist has 300,000 observations while utilization has about 1,000,000 observations: proc sql;
create table WORK.MERGED as
select t1.*,
t2.usage,
t2.price
from WORK.userlist t1
left join WORK.utilization t2
on t1.ID=t2.clientID;
quit; However, MERGED had around 300,100 observations in the output. I found that there were duplicates in the MERGED table by using another code such as this, to eliminate them. The output did show the 100 dupes: proc sort data=WORK.MERGED nodupkey dupout=dups_checking;
by _all_;
run; Now, this works fine for now. Here are my questions: 1. I made sure that there are no duplicates in both the left and right table. So why is it still producing duplicates? 2. When I ran code block 1 with another set of table, I still had duplicates. When I ran code block 2, it did show the number of duplicates but I am still not getting the same amount of observations in the original table and the output table. Any insights on this problem?
... View more