07-03-2017 04:16 PM
I want to find observations which have same attitudes by TRD_EVENT_DT, TRD_STCK_CD, TRD_EVENT_TM, TRD_PR, TRD_TUROVR variables.
TRD_EVENT_DT = Date
TRD_STCK_CD = Name
TRD_EVENT_TM = Time
TRD_PR = Price
TRD_TUROVR = Turnover
I use code below:
proc sort data=sampledata out=sampledata01; by TRD_EVENT_DT TRD_STCK_CD TRD_EVENT_TM TRD_PR TRD_TUROVR; run; * Using PROC FREQ to find duplicate observations and route them to an output data set with multiple key variables; proc freq data = sampledata01 noprint ; by TRD_EVENT_DT TRD_STCK_CD TRD_EVENT_TM TRD_PR TRD_TUROVR; table TRD_EVENT_DT TRD_STCK_CD TRD_EVENT_TM TRD_PR TRD_TUROVR / out = sampledata_DUP (keep = TRD_EVENT_DT TRD_STCK_CD TRD_EVENT_TM TRD_PR TRD_TUROVR Count where = (Count > 1)) ; run;
Is this code true?
If not, Please tell me which statement is wrong.
Thanks in advance.
07-03-2017 04:23 PM
You should use PROC SORT to actively identify your duplicates.
If youre using PROC FREQ you need to remove the BY and add * between the variables.
This finds duplicates across age and sex.
Proc freq data = SASHELP.class; Table sex*age / out = record_count list; Run;
Need further help from the community? Please ask a new question.