BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
jpeitz
Fluorite | Level 6

I have a cross-sectional data set with multiple 'age at diagnosis' variables. i.e. : 

data have;
input seqn cxage1 cxage2 cxage3 ;
1 58 51 .
2 24 35 55 
3 30 . .
4 52 44 .
run;

(I didn't manually input the data so this code might be wrong, it's just to show about what the set would look like if it was entered this way)

 

I would like to create one variable that uses the lowest 'age at diagnosis' whether it be cxage1 cxage2 or cxage3 so that the new dataset would look like: 

data want;
input seqn cxage1 cxage2 cxage3 lowage;
1 58 51 . 51
2 24 35 55 24
3 30 . . 30
4 52 44 . 44
run;

I am a fairly novice user for SAS so I've been searching and I've tried a proc sql statement, do until statement, and first statement and all gave me back binary output which is not what I want. 

 

Really appreciate any help and responses

Thank you

jpeitz

1 ACCEPTED SOLUTION

Accepted Solutions
PaigeMiller
Diamond | Level 26

So, SQL is the wrong tool. (Yes, you could make it work in SQL, but in general SQL still the wrong tool to do statistics with; just because you can do something in SQL doesn't mean you should do that thing in SQL)

 

A much better approach is a DATA step, where you can use the MIN() function. Assuming the variable names all begin with cxage, this will work

 

data want;
    set have;
    lowage=min(of cxage:);
run;

 

--
Paige Miller

View solution in original post

4 REPLIES 4
PaigeMiller
Diamond | Level 26

So, SQL is the wrong tool. (Yes, you could make it work in SQL, but in general SQL still the wrong tool to do statistics with; just because you can do something in SQL doesn't mean you should do that thing in SQL)

 

A much better approach is a DATA step, where you can use the MIN() function. Assuming the variable names all begin with cxage, this will work

 

data want;
    set have;
    lowage=min(of cxage:);
run;

 

--
Paige Miller
ballardw
Super User

This is one way:

data want;
   set have;
   array c(*) cxage: ;
   lowage = min(of c(*));
run; 

The key is the call to the MIN function, minimum of a list of variables. You could list the names directly in the function instead of the array as : lowage = min(cxage1, cxage2, cxage3); The array definition using a shorthand list will use all variables that have names starting with cxage. So if your data ends up with 20 variables with that name then it will adjust to use all of them.

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 4 replies
  • 1683 views
  • 2 likes
  • 3 in conversation