<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to compare and identify similar strings (i.e., Names) from two datasets? in New SAS User</title>
    <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693409#M25006</link>
    <description>&lt;P&gt;Hello Experts, I would be grateful if anyone can help me solving this issue.&lt;/P&gt;&lt;P&gt;I have the following two datasets.&lt;/P&gt;&lt;P&gt;In DATASET-I, I have two variables, one is City Codes (ED=Edmonton, TO= Toronto and CA= Calgary) and the second is Biography of different persons.&lt;/P&gt;&lt;P&gt;In DATASET-II, I have three variables, one is City Code (same as dataset-I), second is the Names and third is their respective Status.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; DATASET-I&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; DATASET-II&lt;/P&gt;&lt;P&gt;Code&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Biography&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Code&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Names&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Status&lt;/P&gt;&lt;P&gt;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Joseph Andrew is hardworking person.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Blanc Angela&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Teacher&lt;/P&gt;&lt;P&gt;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Ms. Angela Blanc is genius.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Albert Crew&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Doctor&lt;/P&gt;&lt;P&gt;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mrs. Sarah Greene was absent.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Joseph T. Andrew&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Lawyer&lt;/P&gt;&lt;P&gt;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Rubin D. Elvin cleans the room.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Luis V. Bob&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Doctor&lt;/P&gt;&lt;P&gt;TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Ms. Luis Fernandez is a teacher.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; J. Antonio&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Teacher&lt;/P&gt;&lt;P&gt;TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Jose Antonio works from home.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Accent Andiron&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Worker&lt;/P&gt;&lt;P&gt;TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Luis Van Bob comes late.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Maria Jose Antonio&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Student&lt;/P&gt;&lt;P&gt;CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Ms. Maria Jose travelled last week.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Philip Bursa will help you.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What I want?&lt;/P&gt;&lt;P&gt;First. I want to compare the names available inDATASET-II with the names available in Biography of DATASET-I and identify the city-wise names those sound similar or common in both Datasets. The unmatched names will be skipped.&lt;/P&gt;&lt;P&gt;Second, I want to extract those city wise common names and assign their respective status, e.g., the output can be as:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; DATAOUTPUT&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Code&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Common Names&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Status&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Joseph T. Andrew&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Lawyer&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Angela Blank&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Teacher&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Luis Van Bob&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Doctor&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Jose Antonio&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Teacher&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Maria Jose Antonio&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Studnet&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone guide me how to code it in SAS please?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 22 Oct 2020 07:16:46 GMT</pubDate>
    <dc:creator>anasmalik13</dc:creator>
    <dc:date>2020-10-22T07:16:46Z</dc:date>
    <item>
      <title>How to compare and Identify the similar characters or words such as Names?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693366#M25000</link>
      <description>&lt;P&gt;Hello Experts,&lt;/P&gt;&lt;P&gt;I have a dataset comprised of four variables.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Country Code displays different cities.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Biography displays detail of several individuals working in those different cities.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Names display the names of individuals whose working status is known in next column 'Status.&lt;/P&gt;&lt;P&gt;For example for Country AB, I have Biography of four people (whose Employment Status is Unknown) and Names of other four people (whose Employement Status is Known).&amp;nbsp;I want to find the city-wise matching strings from Names column to Biography column:If names are martched with any of the names presented in Biography column then resulting would be their status.&lt;/P&gt;&lt;P&gt;The output should be as:&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;City&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Matched Name&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Status&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;AB&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Joseph T. Andrew&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Lawyer&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Considering the folowing data table, can anyone guide me how to code it in SAS please?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;City Code&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Biography&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/SPAN&gt;Names&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Status&lt;/P&gt;&lt;P&gt;AB&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Mr. Joseph Andrew is hardworking person.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Blanc Angel&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Teacher&lt;/P&gt;&lt;P&gt;AB&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Ms. Angela Blanc is genius.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Albert Crew&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Doctor&lt;/P&gt;&lt;P&gt;AB&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Mrs. Sarah Greene was absent.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Rubin Ervin&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Engineer&lt;/P&gt;&lt;P&gt;AB&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Mr. Rubin D. Elvin cleans the room.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Joseph T. Andrew&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Lawyer&lt;/P&gt;&lt;P&gt;AC&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Ms. Luis Fernandez is a teacher.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Luis V. Bob&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Doctor&lt;/P&gt;&lt;P&gt;AC&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Mr. Jose Antonio works from home.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;J. Antonio&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Teacher&lt;/P&gt;&lt;P&gt;AC&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Mr. Luis Van Bob comes late. &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Luis Fernandez&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Lawyer&lt;/P&gt;&lt;P&gt;AD&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Ms. Maria Jose travelled last week.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Accent Andiron&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Worker&lt;/P&gt;&lt;P&gt;AD&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Mr. Philip Bursa will help you.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Maria Jose Antonio&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Student&lt;/P&gt;</description>
      <pubDate>Thu, 22 Oct 2020 05:19:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693366#M25000</guid>
      <dc:creator>anasmalik13</dc:creator>
      <dc:date>2020-10-22T05:19:28Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and Identify the similar characters or words such as Names?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693371#M25001</link>
      <description>&lt;P&gt;For the data that you have posted, what would the results that you want look like?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Jim&lt;/P&gt;</description>
      <pubDate>Thu, 22 Oct 2020 02:50:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693371#M25001</guid>
      <dc:creator>jimbarbour</dc:creator>
      <dc:date>2020-10-22T02:50:03Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and Identify the similar characters or words such as Names?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693376#M25002</link>
      <description>I want to code the employment status variable. For this, I have details of several persons having unknown employment status and names of some individual with their known employment status. I want to identify the common names and assign their respective employment status. It is like I have details of a person in Bio column, that name can or cannot be found in Names column. However, if it is found&lt;BR /&gt;then I can say for country AB, the Joseph T. Andrew would be working as Lawyer.</description>
      <pubDate>Thu, 22 Oct 2020 03:29:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693376#M25002</guid>
      <dc:creator>anasmalik13</dc:creator>
      <dc:date>2020-10-22T03:29:36Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and Identify the similar characters or words such as Names?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693385#M25003</link>
      <description>&lt;P&gt;Please post the data you have in usable form and exactly what you expect as result - again a data step using datalines is the preferred way, but a table will do also. Then explain for each observation in the expected result why that row was added.&lt;/P&gt;</description>
      <pubDate>Thu, 22 Oct 2020 04:51:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693385#M25003</guid>
      <dc:creator>andreas_lds</dc:creator>
      <dc:date>2020-10-22T04:51:25Z</dc:date>
    </item>
    <item>
      <title>How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693409#M25006</link>
      <description>&lt;P&gt;Hello Experts, I would be grateful if anyone can help me solving this issue.&lt;/P&gt;&lt;P&gt;I have the following two datasets.&lt;/P&gt;&lt;P&gt;In DATASET-I, I have two variables, one is City Codes (ED=Edmonton, TO= Toronto and CA= Calgary) and the second is Biography of different persons.&lt;/P&gt;&lt;P&gt;In DATASET-II, I have three variables, one is City Code (same as dataset-I), second is the Names and third is their respective Status.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; DATASET-I&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; DATASET-II&lt;/P&gt;&lt;P&gt;Code&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Biography&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Code&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Names&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Status&lt;/P&gt;&lt;P&gt;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Joseph Andrew is hardworking person.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Blanc Angela&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Teacher&lt;/P&gt;&lt;P&gt;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Ms. Angela Blanc is genius.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Albert Crew&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Doctor&lt;/P&gt;&lt;P&gt;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mrs. Sarah Greene was absent.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Joseph T. Andrew&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Lawyer&lt;/P&gt;&lt;P&gt;ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Rubin D. Elvin cleans the room.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Luis V. Bob&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Doctor&lt;/P&gt;&lt;P&gt;TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Ms. Luis Fernandez is a teacher.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; J. Antonio&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Teacher&lt;/P&gt;&lt;P&gt;TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Jose Antonio works from home.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Accent Andiron&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Worker&lt;/P&gt;&lt;P&gt;TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Luis Van Bob comes late.&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Maria Jose Antonio&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Student&lt;/P&gt;&lt;P&gt;CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Ms. Maria Jose travelled last week.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mr. Philip Bursa will help you.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What I want?&lt;/P&gt;&lt;P&gt;First. I want to compare the names available inDATASET-II with the names available in Biography of DATASET-I and identify the city-wise names those sound similar or common in both Datasets. The unmatched names will be skipped.&lt;/P&gt;&lt;P&gt;Second, I want to extract those city wise common names and assign their respective status, e.g., the output can be as:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; DATAOUTPUT&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Code&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Common Names&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Status&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Joseph T. Andrew&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Lawyer&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ED&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Angela Blank&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Teacher&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Luis Van Bob&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Doctor&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TO&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Jose Antonio&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Teacher&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Maria Jose Antonio&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Studnet&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone guide me how to code it in SAS please?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 22 Oct 2020 07:16:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693409#M25006</guid>
      <dc:creator>anasmalik13</dc:creator>
      <dc:date>2020-10-22T07:16:46Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693412#M25007</link>
      <description>&lt;P&gt;&lt;FONT size="5"&gt;Please don't double post questions!&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 22 Oct 2020 07:27:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693412#M25007</guid>
      <dc:creator>andreas_lds</dc:creator>
      <dc:date>2020-10-22T07:27:38Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693420#M25008</link>
      <description>&lt;P&gt;It will be more difficult to match -- reliably -- something like "Jose Antonio" with "J Antonio".&amp;nbsp; However, I have written some code that will do basic text matching.&amp;nbsp; It still needs a bit of tuning.&amp;nbsp; If you are a good SAS coder, you can adjust the code as you see fit.&amp;nbsp; I'll in include the code below.&amp;nbsp; Otherwise, I'll try to spend some more time with it on Thursday or Friday.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Jim&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Results, immediately below.&amp;nbsp; Luis Van Bob is incorrect.&amp;nbsp; That case needs more work.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="jimbarbour_0-1603353838558.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/50901i1717C7183A15ECE8/image-size/medium?v=v2&amp;amp;px=400" role="button" title="jimbarbour_0-1603353838558.png" alt="jimbarbour_0-1603353838558.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Code:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;DATA	Biographies;
	INPUT	City_Code	&amp;amp;	:	$2.
			Biography	&amp;amp;	:	$256.
			;
DATALINES;
ED                      Mr. Joseph Andrew is hardworking person.          
ED                      Ms. Angela Blanc is genius.                       
ED                      Mrs. Sarah Greene was absent.                     
ED                      Mr. Rubin D. Elvin cleans the room.               
TO                      Ms. Luis Fernandez is a teacher.                  
TO                      Mr. Jose Antonio works from home.                 
TO                      Mr. Luis Van Bob comes late.                      
CA                      Ms. Maria Jose travelled last week.               
CA                      Mr. Philip Bursa will help you.                   
;
RUN;

DATA	Name_And_Status;
	DROP	_:;
	RETAIN	_Max_Names	0;

	INPUT	City_Cd		&amp;amp;	:	$2.
			Name		&amp;amp;	:	$64.
			Status		&amp;amp;	:	$32.
			;

	_Rec_Cnt								+	1;
	Name_Cnt								=	COUNTW(Name, ' ');
	IF	Name_Cnt							&amp;gt;	_Max_Names	THEN
		_Max_Names							=	Name_Cnt;

	CALL	SYMPUTX('Rec_Cnt',		_Rec_Cnt, 	'G');
	CALL	SYMPUTX('Max_Names',	_Max_Names, 'G');

DATALINES;
ED     Blanc Angela            Teacher
ED     Albert Crew             Doctor
ED     Rubin Elvin             Engineer
ED     Joseph T. Andrew        Lawyer
TO     Luis V. Bob             Doctor
TO     J. Antonio              Teacher
TO     Luis Fernandez          Lawyer
CA     Accent Andiron          Worker
CA     Maria Jose Antonio      Student
;
RUN;

DATA	Matched_Employment_Status;
	DROP	_:;
	DROP	City_Cd;
	DROP	Name;
	DROP	Status;
	DROP	Name_Cnt;

	ARRAY	Names		[&amp;amp;Rec_Cnt, &amp;amp;Max_Names]	$32	_TEMPORARY_;
	ARRAY	Statuses	[&amp;amp;Rec_Cnt]				$32	_TEMPORARY_;
	ARRAY	Codes		[&amp;amp;Rec_Cnt]				$2	_TEMPORARY_;
	ARRAY	Match		[&amp;amp;Rec_Cnt]				3	_TEMPORARY_;

	IF	_N_									=	1	THEN
		DO;
			DO	UNTIL	(_End_Of_Names);
				SET	Name_And_Status		END	=	_End_Of_Names;
				_Name						+	1;
				Statuses[_Name]				=	Status;
				Codes	[_Name]				=	City_Cd;
				DO	_i						=	1	TO	Name_Cnt;
					Names[_Name, _i]		=	COMPRESS(SCAN(Name, _i),,'KA');
*					PUTLOG	"&amp;amp;Nte1  "   _Name=   _i=  Names[_Name, _i]=  Statuses[_Name]=  Codes[_Name]=  ;
				END;
			END;
		END;

*	STOP;

	SET	Biographies;

	PUTLOG	"&amp;amp;Nte2  ";
	
	_Match_Cnt								=	0;
	_Token_Cnt								=	COUNTW(Biography, ' ');
	PUTLOG	"&amp;amp;Nte1  "  _N_=  _Token_Cnt=;

	DO	_i									=	1	TO	_Token_Cnt;
		DO	_j								=	1	TO	&amp;amp;Rec_Cnt;
			DO	_k							=	1	TO	&amp;amp;Max_Names;
				_Bio						=	COMPRESS(SCAN(Biography, _i),,'KA');
				IF	City_Code				=	Codes[_j]				AND
					SCAN(Biography, _i)		=	Names[_j, _k]			THEN
					DO;
						_Match_Cnt			+	1;
						Match[_Match_Cnt]	=	_j;
					END;
				PUTLOG	"&amp;amp;Nte2  "  _N_=  _i=  _j=  _k=  _Bio=  Statuses[_j]=  Names[_j, _k]=  _Match_Cnt=;
			END;
			IF	_Match_Cnt					&amp;gt;=	2						THEN
				DO;
					_Match					=	1;
					DO	_l					=	1	TO	(_Match_Cnt - 1);
						IF	Match[_l]		^=	Match[_l + 1]			THEN	
							_Match			=	1;
					END;
					IF	_Match											THEN
						DO;
							Employ_Status	=	Statuses[_j];
							_j				=	&amp;amp;Rec_Cnt;
							_i				=	_Token_Cnt;
						END;
				END;
		END;
	END;
RUN;
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 22 Oct 2020 08:05:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693420#M25008</guid>
      <dc:creator>jimbarbour</dc:creator>
      <dc:date>2020-10-22T08:05:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693568#M25034</link>
      <description>&lt;P&gt;OK, a little sleep, and I spotted the deficiency in my program.&amp;nbsp; I now have it matching to the correct people:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="jimbarbour_0-1603387438586.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/50910iCDAFF4A868646DA7/image-size/medium?v=v2&amp;amp;px=400" role="button" title="jimbarbour_0-1603387438586.png" alt="jimbarbour_0-1603387438586.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As I say, though, when your Dataset II (the master definition of who has what status) has only a first initial, it's going to be difficult to match against.&amp;nbsp; It will be difficult to distinguish a J. Antonio from a Maria Jose Antonio, although a secondary match on City Code helps tremendously.&amp;nbsp; There is always the danger of a false positive, i.e. associating someone with a Status that is not really theirs.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I set my matching to require two matches.&amp;nbsp; For example, for the name "Luis Van Bob," you could have up to three matches ("Luis", "Van", and "Bob").&amp;nbsp; There has to be a minimum of two matches for the program to consider the match to be a positive name match.&amp;nbsp; When trying to match, for example, "Luis Fernandez" with "Luis Van Bob", the match will be = 1 because only "Luis" matches.&amp;nbsp; The program does &lt;STRONG&gt;not&lt;/STRONG&gt; consider this to be a name match.&amp;nbsp; However "Luis V Bob" and "Luis Van Bob" would be = 2 ("Luis" and "Bob"), and therefore the program would consider this a name match.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You could still get false positives, but city matching helps much in this regard.&amp;nbsp; For example, Jose Antonio and Maria Jose Antonio could easily be falsely matched, but since Jose Antonio is from Toronto and Maria Jose Antonio is from Calgary, they will not be matched.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;WARNING&lt;/STRONG&gt;:&amp;nbsp; Even matching on both city and name may still give some false positives.&amp;nbsp; For really common names, there absolutely could be two people with the same first and last names in a large city.&amp;nbsp; Name matching is never 100%.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We can increase the number of matches if we use something like the COMPGED function or the SPEDIS function, but this also increases our chances of false positives.&amp;nbsp; See&amp;nbsp;&lt;A href="https://support.sas.com/resources/papers/proceedings14/1674-2014.pdf" target="_blank" rel="noopener"&gt;https://support.sas.com/resources/papers/proceedings14/1674-2014.pdf&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The SAS code is below.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Jim&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;DATA	Biographies;
	INPUT	City_Code	&amp;amp;	:	$2.
			Biography	&amp;amp;	:	$256.
			;
DATALINES;
ED   Mr. Joseph Andrew is hardworking person.          
ED   Ms. Angela Blanc is genius.                       
ED   Mrs. Sarah Greene was absent.                     
ED   Mr. Rubin D. Elvin cleans the room.               
TO   Ms. Luis Fernandez is a teacher.                  
TO   Mr. Jose Antonio works from home.                 
TO   Mr. Luis Van Bob comes late.                      
CA   Ms. Maria Jose travelled last week.               
CA   Mr. Philip Bursa will help you.                   
;
RUN;

**------------------------------------------------------------------------------**;

DATA	Name_And_Status;
	DROP	_:;
	RETAIN	_Max_Names	0;

	INPUT	City_Cd		&amp;amp;	:	$2.
			Name		&amp;amp;	:	$64.
			Status		&amp;amp;	:	$32.
			;

	_Rec_Cnt								+	1;
	Name_Cnt								=	COUNTW(Name, ' ');
	IF	Name_Cnt							&amp;gt;	_Max_Names	THEN
		_Max_Names							=	Name_Cnt;

	CALL	SYMPUTX('Rec_Cnt',		_Rec_Cnt, 	'G');
	CALL	SYMPUTX('Max_Names',	_Max_Names, 'G');

DATALINES;
ED     Blanc Angela            Teacher
ED     Albert Crew             Doctor
ED     Rubin Elvin             Engineer
ED     Joseph T. Andrew        Lawyer
TO     Luis V. Bob             Doctor
TO     J. Antonio              Teacher
TO     Luis Fernandez          Lawyer
CA     Accent Andiron          Worker
CA     Maria Jose Antonio      Student
;
RUN;

**------------------------------------------------------------------------------**;

DATA	Matched_Employment_Status;
	DROP	_:;
	DROP	City_Cd;
	DROP	Name;
	DROP	Status;
	DROP	Name_Cnt;

	ARRAY	Names		[&amp;amp;Rec_Cnt, &amp;amp;Max_Names]	$32	_TEMPORARY_;
	ARRAY	Statuses	[&amp;amp;Rec_Cnt]				$32	_TEMPORARY_;
	ARRAY	Codes		[&amp;amp;Rec_Cnt]				$2	_TEMPORARY_;
	ARRAY	Match		[&amp;amp;Rec_Cnt]				3	_TEMPORARY_;

	IF	_N_									=	1	THEN
		DO;
			DO	UNTIL	(_End_Of_Names);
				SET	Name_And_Status		END	=	_End_Of_Names;
				_j							+	1;
				Statuses[_j]				=	Status;
				Codes	[_j]				=	City_Cd;
				DO	_k						=	1	TO	Name_Cnt;
					Names[_j, _k]			=	COMPRESS(SCAN(Name, _k),,'KA');
				END;
			END;
		END;

	SET	Biographies;

	CALL	MISSING	(of	Match[*]);
	_Token_Cnt								=	COUNTW(Biography, ' ');

	DO	_i									=	1	TO	_Token_Cnt;
		DO	_j								=	1	TO	&amp;amp;Rec_Cnt;
			DO	_k							=	1	TO	&amp;amp;Max_Names;
				_Bio						=	COMPRESS(SCAN(Biography, _i),,'KA');
				IF	City_Code				=	Codes[_j]				AND
					SCAN(Biography, _i)		=	Names[_j, _k]			THEN
					DO;
						Match[_j]			+	1;
					END;
			END;
		END;
	END;

	_Max_Match								=	0;

	DO	_j									=	1	TO	&amp;amp;Rec_Cnt;
		IF	Match[_j]						&amp;gt;=	2						THEN
			DO;
				IF	Match[_j]				&amp;gt;	_Max_Match				THEN
					DO;
						Employ_Status		=	Statuses[_j];
						_Max_Match			=	Match[_j];
					END;
			END;
	END;
RUN;
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 25 Oct 2020 14:05:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693568#M25034</guid>
      <dc:creator>jimbarbour</dc:creator>
      <dc:date>2020-10-25T14:05:34Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693661#M25042</link>
      <description>&lt;P&gt;Thank you for guiding. The code works for the variables you have created but unfortunately I am stuck in creation of temporary arrays for the solution of my dataset. I find the following message in the log.&lt;/P&gt;&lt;P&gt;ARRAY Names [&amp;amp;Rec_Cnt, &amp;amp;Max_Names] $32 _TEMPORARY_;&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;-&lt;BR /&gt;&lt;FONT size="2"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;22&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;76&lt;/FONT&gt;&lt;BR /&gt;&lt;EM&gt;&lt;FONT size="2"&gt;ERROR 22-322: Syntax error, expecting one of the following: an integer constant, *.&lt;/FONT&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;FONT size="2"&gt;ERROR 76-322: Syntax error, statement will be ignored.&lt;/FONT&gt;&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 23 Oct 2020 06:02:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693661#M25042</guid>
      <dc:creator>anasmalik13</dc:creator>
      <dc:date>2020-10-23T06:02:10Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693921#M25073</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/299976"&gt;@anasmalik13&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please run the following code;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;%PUT  NOTE:  &amp;amp;=Rec_Cnt  &amp;amp;=Max_Names;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&lt;EM&gt;Before&lt;/EM&gt; the data step with this code runs:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;ARRAY Names [&amp;amp;Rec_Cnt, &amp;amp;Max_Names] $32 _TEMPORARY_;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Rec_Cnt and Max_Names need to be set to positive integers for the arrays to work.&amp;nbsp; It looks like Rec_Cnt is not being set.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please post the log, and please use the following buttons here on the forum for posting the log:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="jimbarbour_0-1603487315957.png" style="width: 999px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/51015iFB4A0F052F3853F3/image-size/large?v=v2&amp;amp;px=999" role="button" title="jimbarbour_0-1603487315957.png" alt="jimbarbour_0-1603487315957.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Jim&lt;/P&gt;</description>
      <pubDate>Fri, 23 Oct 2020 21:08:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693921#M25073</guid>
      <dc:creator>jimbarbour</dc:creator>
      <dc:date>2020-10-23T21:08:58Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693930#M25074</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;Thank you again for your prompt support. I am using SAS 9.4 but unfortunately, the code is still not working on it. It looks like I am commmitting any mistake&lt;CODE class=" language-sas"&gt;&amp;nbsp;but whta is that I dont know. Please see the following log as reference.
1    DATA    Matched_Employment_Status;
2       DROP    _:;
3       DROP    City_Cd;
4       DROP    Name;
5       DROP    Status;
6       DROP    Name_Cnt;
WARNING: Apparent symbolic reference REC_CNT not resolved.
WARNING: Apparent symbolic reference MAX_NAMES not resolved.
7   %PUT  NOTE:  &amp;amp;=Rec_Cnt  &amp;amp;=Max_Names;
NOTE:  Rec_Cnt  Max_Names
WARNING: Apparent symbolic reference REC_CNT not resolved.
WARNING: Apparent symbolic reference MAX_NAMES not resolved.
8   ARRAY   Names       [&amp;amp;Rec_Cnt, &amp;amp;Max_Names]  $32 _TEMPORARY_;
                          -
                          22
                          76
ERROR 22-322: Syntax error, expecting one of the following: an integer constant, *.

ERROR 76-322: Syntax error, statement will be ignored.

&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 23 Oct 2020 21:18:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693930#M25074</guid>
      <dc:creator>anasmalik13</dc:creator>
      <dc:date>2020-10-23T21:18:17Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693933#M25076</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/299976"&gt;@anasmalik13&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You have to create those two macro variables when you read in the biographical information.&amp;nbsp; They are created in this Data step:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;DATA	Name_And_Status;
	DROP	_:;
	RETAIN	_Max_Names	0;

	INPUT	City_Cd		&amp;amp;	:	$2.
			Name		&amp;amp;	:	$64.
			Status		&amp;amp;	:	$32.
			;

	_Rec_Cnt								+	1;
	Name_Cnt								=	COUNTW(Name, ' ');
	IF	Name_Cnt							&amp;gt;	_Max_Names	THEN
		_Max_Names							=	Name_Cnt;

	CALL	SYMPUTX('Rec_Cnt',		_Rec_Cnt, 	'G');
	CALL	SYMPUTX('Max_Names',	_Max_Names, 'G');

DATALINES;
ED     Blanc Angela            Teacher
ED     Albert Crew             Doctor
ED     Rubin Elvin             Engineer
ED     Joseph T. Andrew        Lawyer
TO     Luis V. Bob             Doctor
TO     J. Antonio              Teacher
TO     Luis Fernandez          Lawyer
CA     Accent Andiron          Worker
CA     Maria Jose Antonio      Student
;
RUN;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;Here are the two key statements:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;	CALL	SYMPUTX('Rec_Cnt',		_Rec_Cnt, 	'G');
	CALL	SYMPUTX('Max_Names',	_Max_Names, 'G');
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;Without those two statements, the macro variables will not be created, and the code will not run in the main Data step.&amp;nbsp; The arrays use the values stored in Rec_Cnt and Max_Names to dimension the arrays, and the subsequent SAS code uses them also to control the matching.&amp;nbsp; Without those two macro variables, the code will never work.&lt;/P&gt;
&lt;PRE class="language-sas"&gt;&lt;CODE&gt;ARRAY   Names       [&amp;amp;Rec_Cnt, &amp;amp;Max_Names]  $32 _TEMPORARY_;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Jim&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 23 Oct 2020 21:27:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693933#M25076</guid>
      <dc:creator>jimbarbour</dc:creator>
      <dc:date>2020-10-23T21:27:35Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693962#M25078</link>
      <description>&lt;P&gt;Thank you for your support. I run the code and it&amp;nbsp;did not show any error in the log nor it generated any result. It would be two hours that it is still processing it.&amp;nbsp;I have two datasets with 1.6 million and 100k observations each. Is not any optimize solution?&lt;/P&gt;</description>
      <pubDate>Sun, 25 Oct 2020 08:30:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693962#M25078</guid>
      <dc:creator>anasmalik13</dc:creator>
      <dc:date>2020-10-25T08:30:52Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693963#M25079</link>
      <description>&lt;P&gt;So your name and status dataset has 100K records? &amp;nbsp;And you have 16 million biographies? &amp;nbsp;How many words are in each biography?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;These are &lt;STRONG&gt;&lt;I&gt;very&lt;/I&gt;&lt;/STRONG&gt; large files for this type of matching. &amp;nbsp;I would want to break the files up otherwise it will take forever to run.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I would probably separate all files into individual files by city code. &amp;nbsp;You don't want to match someone in Edmonton to someone in Vancouver. &amp;nbsp;They're not one and the same. &amp;nbsp;After you separate the files into files for each city, run a separate job for each major city. &amp;nbsp;Small cities could be combined. &amp;nbsp;Run a small test file first perhaps, but then run all of the jobs simultaneously (i.e. in parallel). &amp;nbsp;Parallel processing is probably the only way to speed something of this magnitude up.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Again though, be careful. &amp;nbsp;You are probably going to get multiple people with the exact same name in large cities. &amp;nbsp;Matching by name is never 100%. &amp;nbsp;It's always best to match on a unique ID number whenever possible. &amp;nbsp;You probably already know this, but it bears repeating. &amp;nbsp;Be careful and expect false matches.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Jim&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 24 Oct 2020 04:11:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/693963#M25079</guid>
      <dc:creator>jimbarbour</dc:creator>
      <dc:date>2020-10-24T04:11:13Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/694089#M25084</link>
      <description>&lt;P&gt;Hey Jim, thank you for being so supportive. I have splitted the datasets into chunks of maximum 2-5k observations each, still SAS takes minimum 15 mins to process these observations. I have tried this code on three different datasets but unfortunately the required results are not generated. For example, I get two columns from the biography dataset (city code &amp;amp; biography) and the third matching column (employement status) which is found empty, as there is no match.&lt;/P&gt;&lt;P&gt;If you can try to look the issues, should I share my data?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 25 Oct 2020 07:41:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/694089#M25084</guid>
      <dc:creator>anasmalik13</dc:creator>
      <dc:date>2020-10-25T07:41:47Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/694101#M25085</link>
      <description>&lt;P&gt;Hmm.&amp;nbsp; Odd that it's not working.&amp;nbsp; Yes, we should look at the data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you could post a dozen or so rows of each type of data -- enough that we should get a few matches, that would be very helpful.&amp;nbsp; I think it's best if you post the data using the same icon as you would for SAS logs.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="jimbarbour_3-1603635149665.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/51036i02CDDED79972CB92/image-size/medium?v=v2&amp;amp;px=400" role="button" title="jimbarbour_3-1603635149665.png" alt="jimbarbour_3-1603635149665.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Jim&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 25 Oct 2020 14:13:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/694101#M25085</guid>
      <dc:creator>jimbarbour</dc:creator>
      <dc:date>2020-10-25T14:13:17Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/694176#M25095</link>
      <description>&lt;P&gt;Thank you Jim for your prompt support and guidance.&lt;/P&gt;</description>
      <pubDate>Mon, 02 Nov 2020 03:28:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/694176#M25095</guid>
      <dc:creator>anasmalik13</dc:creator>
      <dc:date>2020-11-02T03:28:31Z</dc:date>
    </item>
    <item>
      <title>Re: How to compare and identify similar strings (i.e., Names) from two datasets?</title>
      <link>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/694181#M25096</link>
      <description>&lt;P&gt;Maybe this sounds to rude, but i have to ask: do plan to write code yourself, or do you expect that the community does your job?&lt;/P&gt;</description>
      <pubDate>Mon, 26 Oct 2020 08:19:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/How-to-compare-and-Identify-the-similar-characters-or-words-such/m-p/694181#M25096</guid>
      <dc:creator>andreas_lds</dc:creator>
      <dc:date>2020-10-26T08:19:01Z</dc:date>
    </item>
  </channel>
</rss>

