you probably have some id va.uess with 2 or more rows in both datasets. suppose you have id=99 twice in both datasets - SQL will form the cartesian join of these (4 output records). datastep will advance each by group until the end of the shorter one, and then hold that row until the end of the longer one (2 output records)
... View more