BookmarkSubscribeRSS Feed
stevenyan0127
Fluorite | Level 6

Hi, I'm having some problem removing special characters in my dataset.

 

For values ">10,000(3/29), how do I just extract "10,000" for those values?

 

I have tried :

initial_dimer=prxchange('s/\(([^\)]+)\)//i', -1, initial_dimer), 

but it just removes the whole thing and leaves it blank.

Screen Shot 2022-09-07 at 4.22.34 PM.png

Any help would be appreciated! Thanks!

3 REPLIES 3
mkeintz
PROC Star

If you just have two standard patterns, namely (where 9 represents any sequence of digits).

  1. >99,999(xxxx)
      and
  2. 999

then in the case of the first pattern you can scan for the 1st "word" starting at position 2, where "word" is a string that terminates at the separator "(".  For the second pattern it's just a straight copy.

 

data have;
  input string $20.;
datalines;
>10,000(3/29)
256
run;

data want;
  set have;
  if string=: '>' then x=scan(substr(string,2),1,'(');
  else x=string;
  put (_all_) (=);
run;

 

Note the 

  =:

comparison operator compares two strings, truncating the longer string to the length of the shorter string.  So the comparison tests whether the string starts with a ">".

--------------------------
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set

Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets

--------------------------
Patrick
Opal | Level 21

and here a regex that should work

data have;
  input string $20.;
datalines;
>10,000(3/29)
256
run;

data want;
  set have;
  string=prxchange('s/^[^\d]*(\d[\d,]*).*$/$1/i', -1, strip(string));
  put string=;
run;
Ksharp
Super User
data have;
  input string $20.;
want=scan(string,1,',.','kd');
datalines;
>10,000(3/29)
256
;
run;

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 1121 views
  • 1 like
  • 4 in conversation