- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Dear all,
I have 2 ordinal variables (quintiles).
Now I want to create a new categorical variable which combine all levels of those ordinal ones.
So my new var will have 25 categories.
Using if statement is really not a good solution.
Could anyone help me out with some tricks?
Thank you.
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
@Minhtrang wrote:
Thank you very much, RW9, for your reply.
2 variables quintiles of Na and K are ordinal variables. They were created from Proc rank with the continuous variables of Na, K.
This suggests to me you already have a data set created by 2 proc ranks, say variables Narank and Krank for each observation in the data set. If so, I suspect you want something like:
data want;
set have;
composite_rank=catx('_',narank,krank);
run;
This will create a character variable composite_rank as "1_1", "1_2", ... "5_4", "5_5". The CATX function works without objection enve when narank and krank are numerics.
If you want a numeric composite variable, then something like
composite_rank=10*narank+krank;
Just remember though: the numeric version assumes that krank is never more than 1 digit. If krank reachs 2 digits (say deciles 1 to 10), you have to multiple narank by 100.
A final note: the default of proc rank is to make qunitles 0 to 4 rather than 1 to 5. In that case, the number version of the composite will show only 1 digit when narank=0 - a good reason to make a character composite.
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set
Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets
--------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Please reveiw the guidance found underneath the Post button on post a new question:
Without anything to work with I can't really say anything.
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Could you please present the sample data and the expected categorical output. This will help to get better response.
I believe using the format we could create the categorical variables, but a sample data will help me to confirm.
Jag
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Quintile of Na |
Quintile of K | NaKgroup |
1 | 1 | 11 |
2 | 2 | 12 |
3 | 3 | 13 |
4 | 4 | 14 |
5 | 5 | 15 |
21 | ||
22 | ||
23 | ||
24 | ||
25 |
Dear all,
My data looks as the table above.
I have 2 variables which are quintiles of Na and K (so each of them has 5 levels: 1st, 2nd,...5th quintile).
Now I want to create new var as NaKgroup which combines those 2 quintiles variables. This new variable will have 25 categories.
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Again: Please post test data in the form of a datastep.
As such I now need to ask the question, are those variables nunmeric? Also, why do the first two columns only appear in the first five rows?
Something like:
data want; set have; nakgroup=input(cats(put(quit_na,best.),put(quint_k)),best.); run;
Assuming both columns are numeric and you want numeric result.
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Thank you very much, RW9, for your reply.
2 variables quintiles of Na and K are ordinal variables. They were created from Proc rank with the continuous variables of Na, K.
Sorry I don't have SAS on the current PC so I can't post test data.
I just looked for hints.
I think I'll try your suggested solution with input statement.
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
@Minhtrang wrote:
Thank you very much, RW9, for your reply.
2 variables quintiles of Na and K are ordinal variables. They were created from Proc rank with the continuous variables of Na, K.
This suggests to me you already have a data set created by 2 proc ranks, say variables Narank and Krank for each observation in the data set. If so, I suspect you want something like:
data want;
set have;
composite_rank=catx('_',narank,krank);
run;
This will create a character variable composite_rank as "1_1", "1_2", ... "5_4", "5_5". The CATX function works without objection enve when narank and krank are numerics.
If you want a numeric composite variable, then something like
composite_rank=10*narank+krank;
Just remember though: the numeric version assumes that krank is never more than 1 digit. If krank reachs 2 digits (say deciles 1 to 10), you have to multiple narank by 100.
A final note: the default of proc rank is to make qunitles 0 to 4 rather than 1 to 5. In that case, the number version of the composite will show only 1 digit when narank=0 - a good reason to make a character composite.
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set
Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets
--------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Dear mkeintz,
You understood my problem so well.
Categorical variable is my desired outcome.
Thank you so much for your explanation in depth as well as the given optimal solution.
Best,
Trang
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Cartesian Product.
proc sql;
select cats(na,k) as new
from
(select distinct na from have),
(select distinct k from have)
;
quit;
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
You can try below:
proc sql;
select cat(t1.NA,t2.K) as NAK
from temp t1,
temp t2;
run;