About ncnickel

ncnickel · ‎07-19-2011

Hello, I originally posted this in the Data Step section and was advised to post it here in the Statistical Procedures section. I have a set of seven dichotomous indicator variables: Step_1 (1=exposed, 0=not exposed) Step_2 (1=exposed, 0=not exposed) Step_3 (1=exposed, 0=not exposed) Step_4 (1=exposed, 0=not exposed) Step_5 (1=exposed, 0=not exposed) Step_6 (1=exposed, 0=not exposed) Step_7 (1=exposed, 0=not exposed) Each of these Steps is a different hospital policy. Currently, hospitals will only receive recognition for having all Steps in place (it is an all or nothing deal. If you have all Steps, you get credit; if you're missing even just 1 Step, you get 0 credit). Having all Steps in place is associated with improved health outcomes. However, it is a huge barrier to have ALL Steps in place. There is reserach out there to suggest that increased numbers of Steps in place is associated with improved health (e.g., All steps is better than 6 Steps which is better than 5 Steps which is better than 4 Steps, etc...). Some States are using this information to start programs where they will recognize hospitals for each additional 2 Steps they have in place. That is, a hospital will receive 1 star for having 2 Steps in place, 2 stars for having 4 Steps in place, 3 stars for having 6 Steps in place. The problem, though, is we don't know which combinations of 2 Steps to prioritize, meaning are there certain combinations of 2 steps that have a larger impact than other combinations of 2 Steps? Our institute's research question then is, "Which combination of 2 Steps is associated with the greatest improvement in health, which combination of 2 Steps is associated with the 2nd greatest improvement in health, which combination is associated with the 3rd greatest and so forth.?" This informaton is to be used by these State programs so they can tell hospitals which Steps to prioritize (meaning which combinations of 2 Steps give the biggest bang in terms of health improvement). In order to identify which combination has the greatest impact we would like to create indicator variables for each of the different combinations of 2 Steps. And then use these indicator variables to identify the associated health impact. e.g., say combination1 is exposed to Step_1 and Step_2. What is the effect of being exposed to combination 1 (i.e., exposed to BOTH Step_1 AND Step_2) as compared with not being exposed to combination 1 (i.e., exposed to NEITHER Step_1 nor Step_2)? Ideally, we would like to create two types of combination variables: 1) Exposed to both Steps in the combination without regard to exposure to Other Steps meaning you could have Step1=1, Step2=1 and then any exposure status for Step3 through Step7 and 2) Exposed to both Steps in the combination and ONLY exposure to those two Steps meaning you would have Step1=1, Step2=1, and then Step3=0, Step4=0, Step5=0, Step6=0, Step7=0 Is there a way to operationalize this simply in SAS without writing out the code for each and every combination? I came across CALL COMB -type commands and saw how they were used to create observations with different combinations of names, but as a new SAS user I am struggling to see how to extend this to creating new indicator variables out of old indicator variables.

ncnickel · ‎07-19-2011

Unfortunately, a regression model (e.g., proc logistic) really doesn't model the potential outcomes from a causal inference framework (thinking in terms of say a propensity score anlaysis versus a standard regression). We want to create a set of treatment variables: exposed to combination1 vs. not exposed to combination1 exposed to combinatino2 vs. not exposed to combination2 exposed to combination3 vs. not exposed to combination3 to see which has the largest impact in combintation. We can interact Step Exposure in a nested do loop to create combinations for situation (2) but we're still facing a challenge for situation (1).

ncnickel · ‎07-19-2011

Thank you for responding art297. I appreciate your focusing questions "Are you sure you have asked for what you really want to achieve?" and "What are you hoping to actually achieve?" Just a quick background: Each of the Steps I've referenced above are a different hospital-based policy. Currently, hospitals will only receive recognition for having all Steps in place (it is an all or nothing deal. If you have all Steps, you get credit; if you're missing even just 1 Step, you get 0 credit). Having all Steps in place is associated with improved health outcomes. However, it is a huge barrier to have ALL Steps in place. There is reserach out there to suggest that increased numbers of Steps in place is associated with improved health (e.g., All steps is better than 6 Steps which is better than 5 Steps which is better than 4 Steps, etc...). Some States are starting programs where they will recognize hospitals for each additional 2 Steps they have in place. That is, a hosptial will recieve 1 star for having 2 Steps in place, 2 stars for having 4 Steps in place, 3 stars for having 6 Steps in place. The problem, though, is we don't know which combinations of 2 Steps to prioritize, meaning are there certain combinations of 2 steps that have a larger impact than other combinations of 2 Steps? Our research question then is, "Which combination of 2 Steps is associated with the greatest improvement in health, which combination of 2 Steps is associated with the 2nd greatest improvement in health, which combination is associated with the 3rd greatest and so forth.?" This informaton is to be used by these State programs so they can tell hospitals which Steps to prioritize (meaning which combinations of 2 Steps give the biggest bang in terms of health improvement). We're using a potential outcomes framework: so having two steps in place as compared with having zero steps in place. And which combination of 2 gives the greatest improvement over 0 Steps in place. We want the other combinations so that we could then say ok, once you have these 2 Steps in place (say Step 4 and Step 7 have the biggest impact), then you should go for this next combination of 2 (say Step 1 and Step 3). So, we think that what we want is the various combinations of 2 Steps, but we could be wrong.

ncnickel · ‎07-19-2011

I have a set of seven dichotomous indicator variables: Step_1 Step_2 Step_3 Step_4 Step_5 Step_6 Step_7 where each variable is set to 1 if the respondent is exposed to the Step and 0 if the respondent is not exposed to the Step. Exposure is NOT exclusive so that a respondent can be exposed to both Step_1 and Step_2. I want to create indicator variables that are for various combinations of (7, 2) e.g., Step_1 and Step_2; Step_1 and Step_3; Step_1 and Step_4; . . . Step_6; and Step_7. (7, 3) e.g., Step_1, Step_2, and Step_3; Step_1, Step_2, Step_4; ... Step_5, Step_6, and Step_7 (7, 4) (7. 5) (7, 6) I am attempting to create 2 types of indicator variables: 1) the combination is exlusive such that for each combination the data would look like: Step_1=1 AND Step_2=1 AND STEP_3=0 AND STEP_4=0 AND STEP_5=0 AND STEP_6=0 AND STEP_7=0 and 2) the combination is not exclusive so that I just want to see if the respondent is exposed to the 2 steps of interest irregardless of whether or not the respondent is exposed to other Steps: e.g., all that matters is that Step_1=1 AND STEP_2=1... the other STEPs do not matter. I imagine I would need to use the CALL LEXCOMB command or another command similar. However, I am unsure how to execute it to call up the various combinations and generate a new variable for each.

Online Status	Offline
Date Last Visited	‎09-01-2015 07:11 AM

Question about creating indicator variables

Re: Creating indicator variables that are combinations of other indica...

Creating indicator variables that are combinations of other indicators

Creating indicator variables that are combinations of other indicators

Question about creating indicator variables

Re: Creating indicator variables that are combinations of other indica...

Creating indicator variables that are combinations of other indicators

Creating indicator variables that are combinations of other indicators