<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Compare similar names in one data in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663300#M197969</link>
    <description>&lt;P&gt;The more variables, the more reasons to transpose. Wide datasets are design problems in most cases. You have data in structure.&lt;/P&gt;
&lt;P&gt;Even with all variables transposed, you end up with 7 million obs and 3 variables. That's not "large". Large starts at least 2 orders of magnitude above that.&lt;/P&gt;
&lt;P&gt;Look at my final step again and think about it.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 18 Jun 2020 19:56:57 GMT</pubDate>
    <dc:creator>Kurt_Bremser</dc:creator>
    <dc:date>2020-06-18T19:56:57Z</dc:date>
    <item>
      <title>Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/662927#M197872</link>
      <description />
      <pubDate>Thu, 18 Jun 2020 12:55:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/662927#M197872</guid>
      <dc:creator>Emma8</dc:creator>
      <dc:date>2020-06-18T12:55:14Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/662930#M197874</link>
      <description>&lt;P&gt;You can try something as below&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data want;
set have;
flag_age=(a_age=b_age);
flag_name=(m_name=l_name=c_name);
run;
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 18 Jun 2020 06:41:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/662930#M197874</guid>
      <dc:creator>Jagadishkatam</dc:creator>
      <dc:date>2020-06-18T06:41:25Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/662946#M197875</link>
      <description>I do not want to write for each variables since there are many—any automated version (for example, search same suffix variables and create the flags)?</description>
      <pubDate>Thu, 18 Jun 2020 06:47:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/662946#M197875</guid>
      <dc:creator>Emma8</dc:creator>
      <dc:date>2020-06-18T06:47:10Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663011#M197876</link>
      <description>&lt;P&gt;For numeric variables something like this might work, assuming you have number suffixes on your variables:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;Same_Age_Flag = (min(of age1-age10) = max(of age1 - age10));&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 18 Jun 2020 07:13:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663011#M197876</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2020-06-18T07:13:34Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663046#M197877</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;try this:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data have;
input a_age b_age m_name $ l_name $ c_name $;
cards;
1 1 A A A
1 0 A B C
;
run;

%macro getList(ds, sufix, separator=%str( ));
  %local variables rc;
  %let rc = %sysfunc(DoSubL(%str(
   options nonotes nosource nomprint nosymbolgen nomlogic;
  proc transpose 
    data = &amp;amp;ds.(obs=0) 
    out = _(where=(_Name_ like '%'||"&amp;amp;sufix." escape "\"));
  var _all_;
  run;
  proc sql noprint;
    select _name_ into :variables separated by "&amp;amp;separator." from _;
    drop table _; 
  run;
  )));
&amp;amp;variables.
%mend getList;

%put *%getList(have, \_name,separator==)*; 

options symbolgen mlogic;
  data want;
  set have;
  flag_age=(%getList(have, \_age,separator==));
  flag_name=(%getList(have, \_name,separator==));
run;
proc print;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;All the best&lt;/P&gt;
&lt;P&gt;Bart&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jun 2020 07:27:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663046#M197877</guid>
      <dc:creator>yabwon</dc:creator>
      <dc:date>2020-06-18T07:27:55Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663082#M197880</link>
      <description>&lt;P&gt;Solving issues like this becomes very easy with a long dataset structure, as this enables you to write the code without referencing variables literally:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data have;
input a_age b_age m_name $ l_name $ c_name $;
id = _n_; /* create an ID */
cards;
1 1 A A A
1 0 A B C
;

/* get a list of all variables except the ID */
proc sql noprint;
select name into :vars separated by " "
from dictionary.columns
where libname = 'WORK' and memname = 'HAVE' and upcase(name) ne 'ID';
quit;

proc transpose data=have out=trans (rename=(col1=value));
by id;
var &amp;amp;vars;
run;

data long;
set trans;
name = scan(_name_,-1,'_');
drop _name_;
run;

/* the following is ALL the code you need with a long structure */
proc sql;
create table want as
  select
    id,
    name,
    (count(distinct value) = 1) as flag
  from long
  group by id, name
;
quit;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;See Maxim 19: Long Beats Wide.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jun 2020 08:02:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663082#M197880</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2020-06-18T08:02:48Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663274#M197954</link>
      <description>Thank you. The data is very large (over 10000 obs and about 700 variables ) —so anyway that without transpose the data ?</description>
      <pubDate>Thu, 18 Jun 2020 18:56:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663274#M197954</guid>
      <dc:creator>Emma8</dc:creator>
      <dc:date>2020-06-18T18:56:14Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663300#M197969</link>
      <description>&lt;P&gt;The more variables, the more reasons to transpose. Wide datasets are design problems in most cases. You have data in structure.&lt;/P&gt;
&lt;P&gt;Even with all variables transposed, you end up with 7 million obs and 3 variables. That's not "large". Large starts at least 2 orders of magnitude above that.&lt;/P&gt;
&lt;P&gt;Look at my final step again and think about it.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jun 2020 19:56:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663300#M197969</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2020-06-18T19:56:57Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663461#M198053</link>
      <description>Sas could not transpose (warning message displayed as out of resource )—try to create over 30 000 000 000</description>
      <pubDate>Fri, 19 Jun 2020 12:44:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663461#M198053</guid>
      <dc:creator>Emma8</dc:creator>
      <dc:date>2020-06-19T12:44:31Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663472#M198060</link>
      <description>&lt;P&gt;A transpose of 700 variables out of 10000 observations can NOT end up with more than 7 million observations.&lt;/P&gt;
&lt;P&gt;Did you use an id variable that identifies each individual observation?&lt;/P&gt;
&lt;P&gt;Please post the whole log of the transpose step.&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jun 2020 13:11:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663472#M198060</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2020-06-19T13:11:54Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663484#M198064</link>
      <description>Sorry, almost 30 000 obs and 1200 variables</description>
      <pubDate>Fri, 19 Jun 2020 13:40:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663484#M198064</guid>
      <dc:creator>Emma8</dc:creator>
      <dc:date>2020-06-19T13:40:52Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663491#M198068</link>
      <description>&lt;P&gt;So you get 36 million observations. Where is the problem?&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jun 2020 13:55:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663491#M198068</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2020-06-19T13:55:37Z</dc:date>
    </item>
    <item>
      <title>Re: Compare similar names in one data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663498#M198073</link>
      <description>Yeah, I do not have enough memory</description>
      <pubDate>Fri, 19 Jun 2020 14:09:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Compare-similar-names-in-one-data/m-p/663498#M198073</guid>
      <dc:creator>Emma8</dc:creator>
      <dc:date>2020-06-19T14:09:51Z</dc:date>
    </item>
  </channel>
</rss>

