I have got a dataset and I am trying to find values in a character variable that might have been a new addition between each month.
I know that a proc freq can list out the values and the frequency for a variable, but can someone please help to provide some code that would output the new values that are different when comparing two months together (as there are over 100 values in the variable, so it would be difficult to manually compare which values are new for the most recent month vs the previous month)?
The screenshot below is just an extract example (there are over 100 values in the variables that I am looking at), but what would be the best way to:
1. Create an output where it shows any new values that are new when comparing month-to-month. In the example below, the output would output "Orange" (and possibly show how many times it appears), as "Orange" appears for a person in May but not in April.
2. Output the differences in the values between the two months (i.e. highlight any colours that do not appear in both, as there are also some variables in my dataset where there will be one colour in one month but not the other, then a different colour in the other month vs the month before)
proc sql;
    select distinct variable from dataset where month=<month value>
    except 
    select distinct variable from dataset where month=<previous month value>
    ;
quit;
Might be one way. The "except" reports values, or combinations of variable values found in the first set but not the second.
It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.
