Hi, my SAS skill is a beginner and I am looking for any good guide/workflow material which I can adapt to have a robust data cleaning and management procedure and documentation. If anyone has any suggestion, I will be very grateful to know. Thank you.
SAS has a product for data cleaning called SAS Data Quality. Unfortunately, I doubt you would have it available to you and the documentation does not include "how to" guides for particular techniques.
I suggest you start with one use case, say cleaning one column. The first step would be analyse the data in the column and if it is a character column, report on the distinct values like this:
proc freq data = MyData;
table MyColumn / missing;
run;
Then you need to compare the report with what values you expect that column to have, so you can now identify the values that are errors.
A book that may be of interest: https://support.sas.com/en/books/reference-books/codys-data-cleaning-techniques-using-sas.html
There are some excerpts available for download but the book needs to be purchased for everything.
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.