Frequency of words in char var

Ronein · Posted 01-13-2022 01:40 AM

Hello

The data set contain 3 variables: Customer ID, transfer description, amount.

transfer description is a char variable.

Each customer ID can have multiple rows.

I want to do the following :

Find the frequency distribution of separate words from field "transfer description"


data have;
informat transfer_description $5000.;
input ID  transfer_description  amount;
infile datalines delimiter='|';
datalines;
111|Transfer to my friend Joe Kaplan|1000 
111|Salary IBM|2000
111|Salary WIZZ|3980
111|Transfer to my father|500
333|Transfer to my gf|4000
333|Son help|3000
222|Salary IBM|1500
222|Charity|2500
222|Charity|3000
222|transfer to my friend Jula|8000
;
Run;

andreas_lds · Posted 01-13-2022 02:36 AM

Seems to be dealing with the same problem as https://communities.sas.com/t5/SAS-Programming/split-char-into-multiple-columns-and-then-append-the-...

Do you want the frequencies by id?

Kurt_Bremser · Posted 01-13-2022 04:08 AM

data long;
set have;
length desc $20; /* set sufficient length as needed */
do i = 1 to countw(transfer_description," ");
  desc = scan(transfer_description,i," ");
  output;
end;
keep desc;
run;

proc freq data=long;
tables desc;
run;

Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
The macro for direct download as ZIP
How to post code
Please vote for Provide Sequential Search Capability for Hash Objects
How to deal with locked files on UNIX

Frequency of words in char var

Re: Frequency of words in char var

Re: Frequency of words in char var

Frequency of words in char var

Re: Frequency of words in char var

Re: Frequency of words in char var

Registration is open

SAS Training: Just a Click Away