Hi exports,
I have a table which contain 5 dummy variables. How could I drop any variable if number of "1" is less than 10% .
something like:
data tmp;
set df;
array var(i) var1--var5;
IF sum(var)/total le 0.1 THEN drop;
run;
There might be simpler ways, but here is how I would do it:
proc summary nway data=df;
var var1-var5;
output out=_means_ mean=;
run;
proc transpose data=_means_(drop=_type_ _freq_) out=_means_t;
run;
proc sql;
select _name_ into :names separated by ' ' from _means_t
where col1<=0.1;
quit;
data want;
set df(drop=&names);
run;
There might be simpler ways, but here is how I would do it:
proc summary nway data=df;
var var1-var5;
output out=_means_ mean=;
run;
proc transpose data=_means_(drop=_type_ _freq_) out=_means_t;
run;
proc sql;
select _name_ into :names separated by ' ' from _means_t
where col1<=0.1;
quit;
data want;
set df(drop=&names);
run;
This works perfect for me, thanks a lot!
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and save with the early bird rate—just $795!
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.