I need to create a dup record report. I have a code, but it's not working as the data is huge (more than 100mil). I have a dataset with more than 200 columns and 100+ mil records. This is what I'm trying to do - column_names duplicate_values a No b Yes c Yes d N/A e No f No g No h Yes i N/A j Yes Here, it there are duplicate values in a column, then it's "Yes". If there are no duplicate values at all, then "No", if the value is unique (one value for all the records), or the values are 100% missing for that column, then "N/A". As I said, I have a code to do the job, but it only works if the record count is less than a million. This is the step where it runs out of memory - proc freq data = dup nlevels; ods output nlevels = duplevels; run; Any help would appreciated. Thanks!
... View more