BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Gil_
Quartz | Level 8
Hi I have a table with a comment box I need to extract key words from comment box Theft. Or Storm
The data looks like this

Id. Comment
1A. GL THE BOAT STORM
2A. SUNSHINE
3A. STORM IN THE CITY
4A. THEFT. IN THE JUNGLE
5A. RAINY DAY
6A. THEFT

Output


Id. Comment. Output
1A. GL THE BOAT STORM. Storm
2A. SUNSHINE
3A. STORM IN THE CITY. Storm
4A. THEFT. IN THE JUNGLE. Theft
5A. RAINY DAY
6A. THEFT. Theft
1 ACCEPTED SOLUTION

Accepted Solutions
JoshB
Quartz | Level 8

You can use the find function to do this.

 

data have;
  set want;
  if find(comment,"storm","i")>0 then output = "Storm";
  else if find(comment,"theft","i")>0 then output = "Theft";
run;

This assumes you will only ever find one or the other. If it is possible to have both, you would have to check for both simultaneously and concatenate your results.

View solution in original post

4 REPLIES 4
JoshB
Quartz | Level 8

You can use the find function to do this.

 

data have;
  set want;
  if find(comment,"storm","i")>0 then output = "Storm";
  else if find(comment,"theft","i")>0 then output = "Theft";
run;

This assumes you will only ever find one or the other. If it is possible to have both, you would have to check for both simultaneously and concatenate your results.

Gil_
Quartz | Level 8
I will test it's one or the other ... thanks for response
ArtC
Rhodochrosite | Level 12

The IFC function can also be used to consolidate.  In this case i choose to look for either or both.

data have;
input Id $ @5 Comment $30.;
datalines;
1A. GL THE BOAT STORM
2A. SUNSHINE
3A. STORM IN THE CITY
4A. THEFT. IN THE JUNGLE
5A. RAINY DAY
6A. THEFT
7A. THEFT IN A STORM
run;
data want;
   set have;
   output=catx(',',ifc(index(comment,'STORM'),'storm',' '),ifc(index(comment,'THEFT'),'theft',' '));
run;
Astounding
PROC Star

An issue to consider:  do you want the exact word only, or variations such as THEFTS, RAINSTORM, STORMY ...

 

Depending on your answer, you might switch from FIND to FINDW.

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 4 replies
  • 1236 views
  • 2 likes
  • 4 in conversation