Hello all, I am having trouble finding a way to create binary variables from a “select all that apply” survey question in order to run logistic regression. The question asks about factors that inhibit health care use, with the option to choose more than one answer choice out of 8 possible choices (e.g., “Concerned about quality of care,” “Concerned about privacy,” etc.). I would like to combine certain answer choices and create binary variables to represent them. For example, the first binary variable would represent concerns about quality of care from answer choices (1) and (2), all other responses that do not contain (1), (2), or both, would be coded as 0 (i.e., no concerns about quality of care). The second binary variable would represent concerns about privacy from answer choices (3) and (4), all other answer choices that do not contain (3), (4), or both, would be coded as 0 (i.e., no concerns about privacy). My goal is complicated by the fact that some respondents selected choices (1), (2), (3), and/or (4) at once. As an example, here is a snippet of what the raw frequencies look like: I was able to separate each answer choice into its own variable using the following code: data &health;
set &health;
array q301_[8] ;
do index=1 to 8;
q301_[index]=0 ne findw(q301,cats(index),',','t');
end;
drop index;
run; Some of the output was as follows: However, given that this is “select all that apply”, I’m not sure how to manipulate the data to create binary variables that combine answer choices as described above. Is this possible?
... View more