Hi exports,
I have a table which contain 5 dummy variables. How could I drop any variable if number of "1" is less than 10% .
something like:
data tmp;
set df;
array var(i) var1--var5;
IF sum(var)/total le 0.1 THEN drop;
run;
There might be simpler ways, but here is how I would do it:
proc summary nway data=df;
var var1-var5;
output out=_means_ mean=;
run;
proc transpose data=_means_(drop=_type_ _freq_) out=_means_t;
run;
proc sql;
select _name_ into :names separated by ' ' from _means_t
where col1<=0.1;
quit;
data want;
set df(drop=&names);
run;
There might be simpler ways, but here is how I would do it:
proc summary nway data=df;
var var1-var5;
output out=_means_ mean=;
run;
proc transpose data=_means_(drop=_type_ _freq_) out=_means_t;
run;
proc sql;
select _name_ into :names separated by ' ' from _means_t
where col1<=0.1;
quit;
data want;
set df(drop=&names);
run;
This works perfect for me, thanks a lot!
Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.
If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.