03-08-2013 04:34 AM
I have a general question abour proc compare.
Is the output from proc compare correct when there is no row level unique id associated with the data but I do use a non-unique column as the ID? For example, if I have an Adverse Event dataset where there is no unique id defined but I use subject number as the ID - is the result of using subject number as the ID dependable? I can also manufacture an ID based on subject number,AE text and start date but this is not guarenteed to be unique 100% of the time.
Is there a way to create an ID across both base/compare datasets to resolve this problem?
03-08-2013 07:22 AM
Don't use ID statement. PROC COMPARE will compare record by record as if you said ID _N_; sort of.
If you are doing what I think you are, QCing ADaM data, I would use record level compare. No ID statement.