<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: DI Studio: Join w/ Contains in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255520#M6958</link>
    <description>&lt;P&gt;We are in the process of getting SAS Data Flux set up.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In the meantime, we're going to manuall put together a table of specific searches and create categories for them (something we'd probably do in a QKB). What I'm doing here in DI Studio is sort of an ad hoc utility to help us on our way.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For this case, I found what I needed. Apparently, in the Join transformation the operand following the CONTAINS operator needs to be within STRIP(). Along with the sounds like operator, I'm able to get some general groupings together that we can manually analyze to determine what categories we may potentially want to create.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for being on top of responding to questions, Linus!&lt;/P&gt;</description>
    <pubDate>Wed, 09 Mar 2016 15:37:22 GMT</pubDate>
    <dc:creator>jwhite</dc:creator>
    <dc:date>2016-03-09T15:37:22Z</dc:date>
    <item>
      <title>DI Studio: Join w/ Contains Operator</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255015#M6910</link>
      <description>&lt;P&gt;I've got a dataset that has a bunch of searched terms pulled down from Google Analytics. What I want to do is group these terms to find categories I can start classifying them as.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For example, the following may all have been searched for:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Breast Cancer&lt;/LI&gt;
&lt;LI&gt;Cancer&lt;/LI&gt;
&lt;LI&gt;Lung Cancer&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;To find these groupings I've created a second copy of this dataset, and am attmepting to join them in DI Studio where if 1.search contains 2.search or 2.search =* (sounds like) 2.search then get then join.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It doesn't seem to be working though. I guess I'm expecting output from both tables joined as one like:&lt;/P&gt;
&lt;P&gt;search_1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; search_2&lt;/P&gt;
&lt;P&gt;Breast Cancer&amp;nbsp; Breast Cancer&lt;/P&gt;
&lt;P&gt;Cancer&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Breast Cancer&lt;/P&gt;
&lt;P&gt;Cancer&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cancer&lt;/P&gt;
&lt;P&gt;Cancer&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Lung Cancer&lt;/P&gt;
&lt;P&gt;Lung Cancer&amp;nbsp;&amp;nbsp;&amp;nbsp; Lung Cancer&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This isn't the ideal final format, but it would at least group the terms somewhat together so I wouldn't have to comb through all these records manually...and make mistakes.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I think the sounds like is working a little, but I'm not getting any of these CANCER examples which I would expect. Afterall, I get them when I do a simple filter using contains ('CANCER') on a given dataset.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Any suggestions?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 20:56:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255015#M6910</guid>
      <dc:creator>jwhite</dc:creator>
      <dc:date>2016-03-07T20:56:18Z</dc:date>
    </item>
    <item>
      <title>Re: DI Studio: Join w/ Contains</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255069#M6923</link>
      <description>&lt;P&gt;Is it not possible for the second parameter in the CONTAINS statement to be a column? Should this be done via code instead of the DI Studio Join transformation?&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 19:30:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255069#M6923</guid>
      <dc:creator>jwhite</dc:creator>
      <dc:date>2016-03-07T19:30:49Z</dc:date>
    </item>
    <item>
      <title>Re: DI Studio: Join w/ Contains</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255109#M6928</link>
      <description>&lt;P&gt;I'm not sure if I understand what you are trying to do - building diagnosis groups using sounds like or similar algorithms?&lt;/P&gt;
&lt;P&gt;If you get a match table, how will you&amp;nbsp;proceed from there?&lt;/P&gt;
&lt;P&gt;There are SW specially designed for fuzzy logic and similar to normalize&amp;nbsp;text strings. As example, ta a look at SAS Data flux suite of products.&lt;/P&gt;
&lt;P&gt;Other thoughts, this approach feels a bit ad-hoc. Also, a bit exploratory. DI Studio is the tool if you know what to do, and want to do it regularly, in an automated way. Perhaps you should try some more&amp;nbsp;exploratory tools as a start, and then move to DI Studio when you are clser to solution.&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 21:49:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255109#M6928</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2016-03-07T21:49:29Z</dc:date>
    </item>
    <item>
      <title>Re: DI Studio: Join w/ Contains</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255520#M6958</link>
      <description>&lt;P&gt;We are in the process of getting SAS Data Flux set up.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In the meantime, we're going to manuall put together a table of specific searches and create categories for them (something we'd probably do in a QKB). What I'm doing here in DI Studio is sort of an ad hoc utility to help us on our way.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For this case, I found what I needed. Apparently, in the Join transformation the operand following the CONTAINS operator needs to be within STRIP(). Along with the sounds like operator, I'm able to get some general groupings together that we can manually analyze to determine what categories we may potentially want to create.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for being on top of responding to questions, Linus!&lt;/P&gt;</description>
      <pubDate>Wed, 09 Mar 2016 15:37:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255520#M6958</guid>
      <dc:creator>jwhite</dc:creator>
      <dc:date>2016-03-09T15:37:22Z</dc:date>
    </item>
    <item>
      <title>Re: DI Studio: Join w/ Contains Operator</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255597#M6959</link>
      <description>Cheers and good luck!</description>
      <pubDate>Wed, 09 Mar 2016 17:56:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/DI-Studio-Join-w-Contains-Operator/m-p/255597#M6959</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2016-03-09T17:56:44Z</dc:date>
    </item>
  </channel>
</rss>

