- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Hello! I'm starting with sas base and I have a question.
I'm not able to find the way to create a new column which categorizes different words form other column in the same dataset.
I need to create a column (named category) with 3 different variables: "Clothes", "Bags" and "Other". So when the main column says t-shirt in the new category colum would say "clothes" if it says shoes the new columns would say other, etc
Clothes would search words like t-shirts, hats, etc
bags would search bagpacks, packs, etc
and other the rest of words
Thank you for the help!
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Be aware of handling different cases of text like this:
data want;
set have;
if lowcase(main_column) in ('t-shirts', 'hoodies', 'hats') then do;
new_column = 'Clothes';
end;
else if lowcase(main_column) in ('bags', 'backpacks') then do;
new_column = 'Bags';
end;
else do;
new_column = 'Other';
end;
run;
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Something like this? 🙂
data want;
set have;
if main_column in ('T-Shirt' /*, Insert other types of clothes here separated by , */) then do;
new_column = 'Clothes';
end;
else if main_column in ('Shoes') then do;
new_column = 'Other';
end;
else if main_column in ('Some Bags Name') then do;
new_column = 'Bags';
end;
run;
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Not sure I understand completely, but you can put as many words inside the parenthesis as you want as below? 🙂
data want;
set have;
if main_column in ('T-Shirt' /*, Insert other types of clothes here separated by , */) then do;
new_column = 'Clothes';
end;
else if main_column in ('Shoes', 'Socks', 'Ties') then do;
new_column = 'Other';
end;
else if main column in ('Some Bags Name') then do;
new_column = 'Bags';
end;
run;
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
I mean, imagine you have 1000 words to put in the parenthesis, is there a way to say this;
T-shirts, hoodies, hats = clothes
bags, backpacks = bags
all the rest of the words = other ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Ah, now I get it 🙂 Sure:
data want;
set have;
if main_column in ('T-shirts', 'hoodies', 'hats') then do;
new_column = 'Clothes';
end;
else if main_column in ('bags', 'backpacks') then do;
new_column = 'Bags';
end;
else do;
new_column = 'Other';
end;
run;
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Anytime. Please mark my answer as a solution if it solved your problem 🙂 Regards.
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Be aware of handling different cases of text like this:
data want;
set have;
if lowcase(main_column) in ('t-shirts', 'hoodies', 'hats') then do;
new_column = 'Clothes';
end;
else if lowcase(main_column) in ('bags', 'backpacks') then do;
new_column = 'Bags';
end;
else do;
new_column = 'Other';
end;
run;