01-21-2015 10:53 AM
Hello to everyone!
Does anybody know if there is a way to group rows that are duplicated in a data set? For example, to put them at the end, or mark them with a color, or create another data set with the duplicated rows.
I appreciate any kind of help!
01-21-2015 12:07 PM
Hello Linus! Thank you so much for your reply!
Im not so sure what PROC SORT does, Im kind of new in SAS Guide, and I just know how to use some tasks, I guess you are talking about prorgraming. If that is the right path to take, could you tell me if there is a good tutorial for beginners? Please!
Anyway the data is just a matrix with 10 colunms, some columns have dates, others have strings and others numbers. I dont know if that is enough info, how should I send a sample data?...
01-21-2015 12:19 PM
UCLA operates a great site for individuals wanting to learn SAS. Here is the URL: http://www.ats.ucla.edu/stat/sas/default.htm
They have nice tutorials and movies to watch.
SAS also has free training. Check this link: SAS Training Starting Points Look for the "Free Training at Your Fingertips".
01-21-2015 01:25 PM
To answer your question, the SORT procedure (written as PROC SORT) in the SAS editor window sorts your observations based on any number of variables in your data set. For example, you mentioned dates, you can sort the data by dates, using a BY statement, and keep the first record with a given date in your data set and copy the duplicate records into a new data set using the DUPOUT= option mentioned above. Here is an example:
proc sort data=file dupout=dup nodupkey;
Note that you can sort a data set by any number of BY variables (second line), not just the date, but other variables in your data set.
Hope that helps,