BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
tim_bradshaw
Calcite | Level 5

Hi there

 

Fairly new to SAS and would really appreciate help with a particular problem.

 

I have a dataset comprising multiple rows per person.  For each person, I would like to transfer the maximum value from variable 1 (var1) onto all the other rows for that person and into a new variable (newvar).

 

e.g.

 

ID   Var1

1    4 

1    5

1    2

1    2

--

2    6

2    9

2    1

2    0

 

....would become.....

 

ID   Var1   Newvar

1    4         5

1    5         5

1    2         5

1    2         5

--

2    6         9

2    9         9

2    1         9

2    0         9

 

 

Any advice much appreciated!

 

Thank you

 

1 ACCEPTED SOLUTION

Accepted Solutions
Tom
Super User Tom
Super User

This is easy to do in PROC SQL because SAS will automatically remerge summary statistics for you.

 

 

proc sql ;
   create table want as
     select *,max(var1) as newvar
     from have
     group by id
   ;
quit;

You could do it in a data step using a technique known as DOW loops. The data must be sorted by ID .

data want ;
  do until (last.id);
     set have;
     by id;
     newvar=max(newvar,var1);
  end;
  do until (last.id);
     set have;
     by id;
     output;
  end;
run;

 

View solution in original post

5 REPLIES 5
RW9
Diamond | Level 26 RW9
Diamond | Level 26

Hi,

 

This selects all the data, then left joins the max value per ID group.

 

proc sql;
  create table WANT as
  select  A.*,
            B.MAX_VALUE
  from    HAVE A
  left join (select ID,
               max(VAR1) as MAX_VALUE 
               from HAVE 
               group by ID) B
  on       A.ID=B.ID;
quit;
mohamed_zaki
Barite | Level 11
proc sort data=have;
by id descending var1;
run;
data want;
set have;
retain newvar;
by id descending var1;
if first.id then do;
newvar=var1;
end;
run;
Tom
Super User Tom
Super User

This is easy to do in PROC SQL because SAS will automatically remerge summary statistics for you.

 

 

proc sql ;
   create table want as
     select *,max(var1) as newvar
     from have
     group by id
   ;
quit;

You could do it in a data step using a technique known as DOW loops. The data must be sorted by ID .

data want ;
  do until (last.id);
     set have;
     by id;
     newvar=max(newvar,var1);
  end;
  do until (last.id);
     set have;
     by id;
     output;
  end;
run;

 

tim_bradshaw
Calcite | Level 5

Thank you so much for your solutions. Your experise are much appreciated!

 

I meant to say that I have hundreds of millions of observations so an efficient command will be the most useful.  I will try these and see....

 

Thanks again

 

 

data_null__
Jade | Level 19

If you have hundreds of millions of observations then perhaps you should be thinking about big picture.  It will take a long time to read every observation find the max and write out all the data again with max attached. Then what? Will you read all hundreds of millions back in again to do what?  Programs that are adequate for most applications with thousand of observations may not be adequate when you have hundreds of millions of observations.

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 5 replies
  • 2272 views
  • 2 likes
  • 5 in conversation