<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Merging two databases with two common variables, one of common variables have inexact characters in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/426007#M13133</link>
    <description>&lt;P&gt;Your first step will be to get to know your data and identify the data quality problems in detail. From there you can develop the necessary steps to rectify them.&lt;/P&gt;
&lt;P&gt;Run proc contents first to see if you have discrepancies in terms of variable types and variable sizes.&lt;/P&gt;
&lt;P&gt;Then run proc sort with nodupkey on your datasets (use the out= option to create new, intermediate datasets) and then compare the results, and look if you have multiple entries for the same business object.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Now, if you have SAS Data Quality licensed, this tool can do most of this work for you.&lt;/P&gt;</description>
    <pubDate>Tue, 09 Jan 2018 09:35:05 GMT</pubDate>
    <dc:creator>Kurt_Bremser</dc:creator>
    <dc:date>2018-01-09T09:35:05Z</dc:date>
    <item>
      <title>Merging two databases with two common variables, one of common variables have inexact characters</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/425974#M13130</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I want two merge two databases:1) KLD, 2)Compustat&lt;/P&gt;&lt;P&gt;They have two common variables: 1)Year and 2)Company Name.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Because my data are panel data I have to use both&amp;nbsp;common variables to merge.&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, company name is not exactly similar in two databases (they may&amp;nbsp;different in terms of &amp;nbsp;capital letters, abbreviations, spelling errors, etc.)&lt;/P&gt;&lt;P&gt;Could you please let me know the coding of how merging these two databases.&lt;/P&gt;&lt;P&gt;I am not familiar with SAS, but I have found out SAS can solve this problem.&amp;nbsp;&lt;/P&gt;&lt;P&gt;If somebody give me the&amp;nbsp;whole codes for this purpose, I would be very thankful.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kind regards,&lt;/P&gt;&lt;P&gt;FA1&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jan 2018 06:51:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/425974#M13130</guid>
      <dc:creator>FA1</dc:creator>
      <dc:date>2018-01-09T06:51:14Z</dc:date>
    </item>
    <item>
      <title>Re: Merging two databases with two common variables, one of common variables have inexact characters</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/425996#M13132</link>
      <description>&lt;P&gt;You will have to fix all those defects to merge the datasets. Depending on the modules you have licensed, this is more or less painful.&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Change company name to all upcase chars.&lt;/LI&gt;
&lt;LI&gt;Replace abbreviations&lt;/LI&gt;
&lt;LI&gt;Find and fix spelling errors.&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;Sorry, but data-cleaning is not the most joyful task one can have.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jan 2018 09:16:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/425996#M13132</guid>
      <dc:creator>andreas_lds</dc:creator>
      <dc:date>2018-01-09T09:16:48Z</dc:date>
    </item>
    <item>
      <title>Re: Merging two databases with two common variables, one of common variables have inexact characters</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/426007#M13133</link>
      <description>&lt;P&gt;Your first step will be to get to know your data and identify the data quality problems in detail. From there you can develop the necessary steps to rectify them.&lt;/P&gt;
&lt;P&gt;Run proc contents first to see if you have discrepancies in terms of variable types and variable sizes.&lt;/P&gt;
&lt;P&gt;Then run proc sort with nodupkey on your datasets (use the out= option to create new, intermediate datasets) and then compare the results, and look if you have multiple entries for the same business object.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Now, if you have SAS Data Quality licensed, this tool can do most of this work for you.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jan 2018 09:35:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/426007#M13133</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2018-01-09T09:35:05Z</dc:date>
    </item>
    <item>
      <title>Re: Merging two databases with two common variables, one of common variables have inexact characters</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/426131#M13135</link>
      <description>&lt;P&gt;KLD and Compustat also have other common variables:&amp;nbsp; CIK (Central Index Key, issued by the SEC), and stock ticker.&amp;nbsp; Now stock ticker for a given company can change over time, so you would have to be aware of synchronization issues.&amp;nbsp; And the database vendors may not have fully populated CIK. But I'd suggest getting those variables for matching purposes.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jan 2018 15:12:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/426131#M13135</guid>
      <dc:creator>mkeintz</dc:creator>
      <dc:date>2018-01-09T15:12:51Z</dc:date>
    </item>
    <item>
      <title>Re: Merging two databases with two common variables, one of common variables have inexact characters</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/428681#M13218</link>
      <description>&lt;P&gt;Dear KurBremser&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you so Much.&lt;/P&gt;&lt;P&gt;Yes, SAS Data Quality Server is the best option for me&amp;nbsp;to solve this problem.&lt;/P&gt;&lt;P&gt;Actually there are two ways to overcome this problem 1)Scoring method, 2) Fuzzy matching using SAS Data Quality Server. I have attached a pdf that explains these two ways very well. It was very helpful for me.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kind regards,&lt;/P&gt;&lt;P&gt;FA1&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jan 2018 04:27:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Merging-two-databases-with-two-common-variables-one-of-common/m-p/428681#M13218</guid>
      <dc:creator>FA1</dc:creator>
      <dc:date>2018-01-18T04:27:30Z</dc:date>
    </item>
  </channel>
</rss>

