BookmarkSubscribeRSS Feed
viollete
Calcite | Level 5

Hi!

 

I have 15  variables, each variable has just two values: 0 and 1.

 

I want to creat another variable (values 0 and 1) which would indicate that from these 15 variables at least 6 have values 1 (or at least 7 and so on)?

 

thanks

6 REPLIES 6
LinusH
Tourmaline | Level 20

I don't know what you are going to use this for, but to me it sounds like it would be easier if you transpose your data, then you could just use SQL to sum/count for each id how many '1' you have.

Another benefit is that you can add more "variables" whenever you want, without changing any logic or change the structure of your data.

Data never sleeps
ballardw
Super User

Easiest is the compare the sum.

In a data step:

 

Newvar = (sum (varname1, varname2 ..., varname15) ge 6);

 

If the variables have nice names like rate1 rate2 ... rate15 then use a variable list

Newvar = (sum (varname1 - varname15) ge 6);

 

Or possibly you could just use a variable with the total.

NewVar = sum(varname1 - varname15);

Astounding
PROC Star

The idea of creating a new variable is good.  However, note that the dash is a minus sign here:

 

newvar = sum(var1 - var15);

 

If you want it to represent a list of variables, you have to add "of" before the list:

 

newvar = sum(of var1 - var15);

ballardw
Super User

@Astounding wrote:

The idea of creating a new variable is good.  However, note that the dash is a minus sign here:

 

newvar = sum(var1 - var15);

 

If you want it to represent a list of variables, you have to add "of" before the list:

 

newvar = sum(of var1 - var15);


 

Thanks for the reminder, generally I'm using that with arrays and didn't want to add complexity for this solution.

FreelanceReinh
Jade | Level 19

If your 15 variables have consecutive internal variable numbers, you could proceed like in the following example:

 

data test;
input numberone b c d e f g h i j k l m n nofifteen;
array atleast[15];
do z=1 to 15;
  atleast[z]=(sum(of numberone--nofifteen)>=z);
end;
drop z atleast1-atleast5;
cards;
1 0 0 1 1 0 1 0 0 1 0 1 1 0 1
;

This would create variables ATLEAST6, ATLEAST7, ..., ATLEAST15 containing the desired 0-1-indicators for "at least 6 ones", "at least 7 ones" and so on.

 

 

In order to check if the 15 internal variable numbers are in fact consecutive, run the following step and look where your 15 variables are located in the output:

 

proc contents data=yourdata varnum;
run;

 

 

If it turns out that the list of your 15 variables in the above PROC CONTENTS output is interrupted by other variables, modify the order of the variables by running the following step on your dataset (variable names taken from previous example):

 

data yourdata;
retain numberone b c d e f g h i j k l m n nofifteen;
set yourdata;
run;

This will ensure that the list numberone--nofifteen contains just the relevant 15 variables.

 

Patrick
Opal | Level 21

Nothing others have suggested already. Just posting a full code sample.

data have(drop=_:);
  array var_ {15} 8.;
  do _rows=1 to 20;
    do _i=1 to dim(var_);
      var_{_i}=floor(ranuni(0)*1.5);
    end;
    output;
  end;
run;

data want;
  set have;
  Six_Plus_indicator= ( sum(of var_:) >= 6 );
run;

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 16. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 6 replies
  • 1335 views
  • 1 like
  • 6 in conversation