- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
hi,
I am havig the following dataset
Customer Month Type Amount
A1 03JAN2004 PETRO 2000
A1 03JAN2004 PETRO 2000
A1 12JAN2004 JEWELLWERY 1000
A1 12JAN2004 JEWELLWERY 1000
Iam not getting the code for taking the duplicate values out in a separate dataset. Iam using this code
Proc sort data=valid_cards1 nodupkey nodopout=duplicates;
by customer month type amount;
run;
Can anybody help?
Thanks & Regards
Mona
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
NODUPKEY option: Deletes observations with duplicate BY values
DUPOUT= option: Specifies the output data set to which duplicate observations are written.
I think your code has some misspelling and should be as this:
data valid_cards1;
input Customer$ Month:date9. Type$ Amount;
format Month date9.;
datalines;
A1 03JAN2004 PETRO 2000
A1 03JAN2004 PETRO 2000
A1 12JAN2004 JEWELLWERY 1000
A1 12JAN2004 JEWELLWERY 1000
;
Proc sort data=valid_cards1 nodupkey dupout=duplicates;
by customer month type amount;
run;
See the available options in the documentation here
https://support.sas.com/documentation/cdl/en/proc/61895/HTML/default/viewer.htm#a000146878.htm
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Proc sort data=valid_cards out=valid_cards_clean dupout=duplicates nodupkey;
By customer month type amount;
Format month date9.;
Run;
You didn't used outfile option