About Rohit_1990

Rohit_1990 · ‎06-05-2020

Hi Samantha, Thanks for your reply but the suggested approach does work for direct match (using index function) but fails to capture variations since variation is within a large string , so can't do a direct spedis or soundex activity for it. Nonetheless thanks again and any further advise is highy valued. Regards

Rohit_1990 · ‎10-04-2019

I am using number strings

ChrisNZ · ‎05-19-2019

> Can you please elaborate more on how to carry out successive matches. Not too sure whats unclear. So the course of action would be: 1. Sort the tables by ZIP 2. Merge on ZIP equality and SUBCITY equal to the start of city (use scan() for the first word if long enough or scan() using the parentheses as delimiter or operator =: , or all these successively) 3. What's hasn't been matched can be retried with other criteria including fuzzy ones, like using the function compged() . The cost get higher but the volumes get smaller. 1. Join the easily found matches using an obvious criterion like ZIP equality and SUBCITY = first word => function scan() 2. Join the unmatched data on a less direct criterion like ZIP equality and SUBCITY = any word => function index() 3. Repeat the process for unmatched data until satisfied: the volume to match goes down as the criterion increases in fuzziness. 4. When finished, append the successive matches. It is a good idea to keep track of the match method so the data includes some sort of match-quality score.

ChrisNZ · ‎05-01-2019

Why don't you keep the street number in your example? What do you do if there is no match?

Rohit_1990 · ‎03-29-2019

Hi, I have no access to sas right now will check it tomorrow in my ofc and post it to you whether it worked on my actual data or not. In the meantime if you can help me with explaination that would be graet help. Thanks a ton 😊

PGStats · ‎03-10-2019

I would simplify to: data want; set have; /* add blank between repeated string of at least two characters per repetition */ str_new=prxchange('s/\b(\w{2,})\1\b/\1 \1/oi', -1, str); run;

Ksharp · ‎03-06-2019

I suggest to split table A into many small tables and do matching . data a; infile datalines truncover; input c1 $char80.; datalines; jack is walking. jack walks. jack is running. Jack runs daily. Jack is jumping He is running over the bridge. He doesn't talk while running ; run; data b; infile datalines truncover; input c2 $20.; datalines; walking walks running runs jump ; run; proc sql; create table want as select * from a left join b on a.c1 contains strip(c2); quit; P.S. tables is stealing from Jenson.

Rohit_1990 · ‎03-04-2019

Thanks a lot !!!!!

tomrvincent · ‎03-01-2019

use the scan function to go word by word, skipping duplicates.

Ksharp · ‎02-23-2019

If I understood right. data x; input C1 C2 $; cards; 1 A 2 A 3 A 4 B 3 B 5 B 1 C 6 C ; run; proc sql; create table temp as select c1,max(c2) as c2,count(*) as n from x group by c1 order by c1; quit; data want; set temp; length lag $ 40; retain lag; lag=lag(c2); if n=1 and _n_ ne 1 then do; if lag>c2 then c2=lag; end; drop n lag; run;

Rohit_1990 · ‎02-21-2019

Hi , Somehow the third part of code forming dataset DD is not getting correctly populated. Column c2 is not getting populated.

kiranv_ · ‎02-16-2019

try encoding options. check the link below. http://support.sas.com/documentation/cdl/en/nlsref/63072/HTML/default/viewer.htm#n0kzmrsdx5evkxn1ihs24h8ljg2b.htm

Ksharp · ‎02-14-2019

OK. How about this one ? data have; input Party1 party2; cards; 1 2 1 7 2 1 2 8 2 9 3 5 3 8 7 3 8 7 9 1 ; run; data want; if _n_=1 then do; if 0 then set have; declare hash h(); h.definekey('party2'); h.definedone(); end; set have; if h.check(key:party1)=0 then delete; else h.replace(); run; proc print;run;

Online Status	Offline
Date Last Visited	‎02-19-2022 07:30 PM

Re: find a probable substring within a string using lookup value

find a probable substring within a string using lookup value

Re: updating from multiple table

Re: updating from multiple table

Re: updating from multiple table

updating from multiple table

Re: INDEX of a substring in as string

Hi Chris thanks for you reply. But I can use scan functio...

Re: INDEX of a substring in as string

Re: INDEX of a substring in as string

Re: Removing string from another string

Re: find a probable substring within a string using lookup value

Re: updating from multiple table

Re: INDEX of a substring in as string

Re: Find position of substring in a string

Re: Fuzzy logic to locate duplicates in a table

Re: Separate words

Re: Extract substribg

Re: seperating number and charcter in a string

Re: prxparse using a column values

Re: Update group value

Re: Update value in table

Re: Issue while reading data from teradata to sas

Re: Relationship analysis issue