SAS Programming

ginak · Posted 05-12-2020 05:03 PM

Hi there,

I have a dataset with long text fields. I need to replace every instance of a dollar amount, say, "439.33" with "439.33USD"

Here's a sample data set:

data one;
infile cards  ;
length name $200;
input name;
cards;
Ccy="USD">738.56</NS!figlwijfoaiu345Ccy="USD">38.44</ScT@
run;

So what I have is:

What I want is USD to be present after every dollar amount as is here:

I found two different community forums, and know that I will need a combination of

index
1. To find the decimal point that follows any "USD" in the text
substr
tranwrd functions
1. To replace that dollar amount with [dollaramount]USD

There could be multiple instances of Ccy="USD">[dollar amount] in the string. I just want to tack on the USD after the dollar amount, but not every dollar amount is XYZ.AB, some are YZ.AB, some are Z.AB etc. There will always be a decimal point with two decimal places though that follow the Ccy="USD"

Thanks!

ChrisNZ · Posted 05-12-2020 05:47 PM

Is it always USD? Or are there other currencies?

High-Performance SAS Coding - Third Edition

ginak · Posted 05-12-2020 06:21 PM

Hi there, yes it'll always say "USD" (and include the double quotes in the text too). Thanks!

ChrisNZ · Posted 05-12-2020 06:15 PM

This replaces:

at least a digit followed by a dot followed by 2 digits

with:

the group found followed by USD.

data TWO;
 set ONE; 
 NAME2=prxchange('s/(\d+\.\d\d)/\1USD/',-1,NAME);
run;

{edited to avoid losing one character}

High-Performance SAS Coding - Third Edition

ginak · Posted 05-12-2020 06:41 PM

I think this works, thank you! It works with my fake sample data but I'll try it with my actual data tomorrow and let you know if I have any questions 🙂 I never understood the prxchange command and where it comes from. Would love to learn more since it seems so useful

ChrisNZ · Posted 05-12-2020 07:34 PM

The syntax is that of Perl regular expressions.

Tons of tutorials on the web, that's how I learnt.

High-Performance SAS Coding - Third Edition

s_lassen · Posted 05-13-2020 04:38 AM

The solution presented by @ChrisNZ goes part of the way, but I would add a bit to make sure that only the right numbers get selected:

name=prxchange('s/("USD">\d+\.\d\d)(?=<)/\1USD/',-1,name);

This looks for the string "USD"> followed by at least one digit, a period and two digits (that's the stuff in the first paranthesis). The whole thing should be followed by "<", but that string is not included in the match (the second paranthesis, "?=" denotes a look-ahead buffer), the match is replaced with the same plus the text "USD".

SAS Programming

How to replace characters in a string that follow certain characters?

Re: How to replace characters in a string that follow certain characters?

Re: How to replace characters in a string that follow certain characters?

Re: How to replace characters in a string that follow certain characters?

Re: How to replace characters in a string that follow certain characters?

Re: How to replace characters in a string that follow certain characters?

Re: How to replace characters in a string that follow certain characters?

Replace a character in a string

Stripping non-specific characters from string by number

Remove Character in String

How to remove specific character from a string value.

How to convert a character value to numeric in SAS

Follow Us

What is...

SAS Programming

Our biggest data and AI event of the year.

SAS Training: Just a Click Away

Follow Us

What is...