Hello everyone,
I am having an issue selecting at subset of observations. I want to select based on the condition that each participant for each event, the subset have 3 or more values (>= .02) of a particular variable (VAR).
For example,
PID Event VAR
1923 1 0
1923 1 .02
1923 1 .03
1923 1 .04
1923 1 .002
1923 1 0
2003 1 0
2003 1 .02
2003 1 .06
2003 1 .07
2003 1 .003
2003 1 0
So in this example, both participants would be selected because they have 3 or more observations in VAR that are >= .02. Please note that each participant has multiple events so I need to check each event for each participant.
Any direction would be greatly appreciated!
Thank you!
Hi,
Something along these lines:
data have;
attrib PID Event VAR format=best.;
input pid event var;
datalines;
1923 1 0
1923 1 .02
1923 1 .03
1923 1 .04
1923 1 .002
1923 1 0
;
run;
proc sql;
create table WANT as
select *
from (select PID,EVENT,VAR,case when VAR > 0.02 then "Y" else "N" end as FLAG from HAVE)
group by PID,EVENT
having COUNT(FLAG)>1;
quit;
data have;
input PID Event VAR;
datalines;
1923 1 0
1923 1 .02
1923 1 .03
1923 1 .04
1923 1 .002
1923 1 0
2003 1 0
2003 1 .02
2003 1 .06
2003 1 .07
2003 1 .003
2003 1 0
;
proc sql;
select * from have
group by pid,event
having sum(var>=0.02)>=3;
quit;
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.