I have a simple problem that I can solve in R easily but just can't figure out how to solve in SAS.
I have a data set where there are numerous diagnostic results concatenated into one long string for each target being identified by the assay.
I simply want to identify subjects that are positive for more than one target.
Here is an example data set.
data test;
input subject_id $1-8
result $11 - 33;
datalines;
ABCD0001 A POS B POS C NEG D NEG
ABCD0002 A POS B NEG C NEG D NEG
ABCD0003 A NEG B NEG C NEG D NEG
ABCD0004 A POS B NEG C POS D NEG
;
run;
Ideally, I'd write code that would return this data set with a flag for subjects ABCD0001 and ABCD0004 because they are positive for more than one target in the 'result' variable.
Thanks for anyone's help!
Hello @abaker_ca and welcome to the SAS Support Communities!
You can use the COUNT function to count the number of "POS" substrings in variable RESULT:
data want;
set test;
flag=count(result,'POS','i')>1;
run;
The optional 'i' modifier makes the search case-insensitive so that "pos", "Pos", etc. would be counted as well. The numeric variable FLAG will contain the value 1 or 0 if the inequality count(...)>1 is true or false, respectively.
To avoid this kind of scenario :
ABCD0001 APOSA POS BNEGB POS C NEG D NEG
You could try this one :
data test;
input subject_id $ result $20.;
datalines;
ABCD0001 A POS B POS C NEG D NEG
ABCD0002 A POS B NEG C NEG D NEG
ABCD0003 A NEG B NEG C NEG D NEG
ABCD0004 A POS B NEG C POS D NEG
;
run;
data want;
set test;
if findw(result,'POS',' ') ne findw(result,'POS',' ','b') then flag='Y';
run;
It's unlikely they would. For the most part the string in this variable would be a disease target followed by the pos/neg call for the target (e.g. "Flu A POS").
Count the occurrences of ' POS '. You might need to append a space to the string if there is a value that completely fills the variable.
data want;
set test;
n_positive = count(result||' ',' POS ');
run;
Note that COUNT() has an optional third argument you can use to make the string comparisons case insensitive.
n_positive = count(result||' ',' POS ','i');
Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!
SAS' Charu Shankar shares her PROC SQL expertise by showing you how to master the WHERE clause using real winter weather data.
Find more tutorials on the SAS Users YouTube channel.