Hello:
I would like to run a logistic model in the binary outcome (Y). In the multivariates, I only need the condition 1 and 2. Therefore, I use "and" to select all of them. Am I correct? Or should I use "or"? Thanks.
proc logistic data=test;
class Y (ref="1") X1 (ref="2") X2 (ref="2") X3 (ref="2") X4 (ref="2") X5 (ref="2");
model Y = X1 X2 X3 X4 X5;
where X1 in (1,2) and X2 in (1,2) and X3 in (1,2) and X3 in (1,2) and X3 in (1,2);
run;
quit;
Only you really know based on your logic. Make some test data and ensure it behaves the way you expect.
Scenarios to test:
X1, X2, X3, X4, X5 all are either 1/2
Any of X1-X5 is not a 1/2
All of X1-X5 is not a 1/2
You may need to break these out into smaller cases but you'll probably understand your logic better at that point.
Get more insight about your data with:
proc sql;
select
x1, x2, x3, x4, x5,
count(*) as n,
case
when X1 in (1,2) and X2 in (1,2) and X3 in (1,2) and X4 in (1,2) and X5 in (1,2) then "Yes"
else "No" end as included
from test
group by x1, x2, x3, x4, x5;
quit;
Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.
Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.