<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches? in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193725#M36415</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Check your hardware settings, even UE is coming with the option set on about 2-3Gb. Your desktop is coming with at least 4Gb and can be easily 12G. Mostly server sessions even far higher.&lt;/P&gt;&lt;P&gt;With those low values (128 Mb) the IO OS systemcache will do. Optimizing buffersize at eg 128kb&amp;nbsp; and there will a 1000 buffers needs. Unless you are needing split second response time looking not a real issue.&lt;BR /&gt;Your selection looks simple (not complicated) your data looks simple (not complicated) unless you are having some challenges there I do not see a lot to optimize with all sequential processing on all data (fastest was) &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sat, 11 Apr 2015 14:12:39 GMT</pubDate>
    <dc:creator>jakarman</dc:creator>
    <dc:date>2015-04-11T14:12:39Z</dc:date>
    <item>
      <title>I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193721#M36411</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;dataset k;&lt;/P&gt;&lt;P&gt;input a b c d;&lt;/P&gt;&lt;P&gt;datalines;&lt;/P&gt;&lt;P&gt;1 2 4 4&lt;/P&gt;&lt;P&gt;1 3 3 4&lt;/P&gt;&lt;P&gt;2 3 4 4&lt;/P&gt;&lt;P&gt;so on ;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data k;&lt;/P&gt;&lt;P&gt;set k;&lt;/P&gt;&lt;P&gt;if( a=xyz and b=lmn and c=hjk and d=cde) then;&lt;/P&gt;&lt;P&gt;flag1=x1;&lt;/P&gt;&lt;P&gt;flag2=x2;&lt;/P&gt;&lt;P&gt;flag3=x3;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;else;&lt;/P&gt;&lt;P&gt;flag1=y1;&lt;/P&gt;&lt;P&gt;flag2=y2;&lt;/P&gt;&lt;P&gt;flag3=y3;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am worried about the execution time of the above. Could anybody help me optimizing the above problem. One solution I have is to index the four columns. Please provide some useful solution to optimize it in best possible manner.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 11 Apr 2015 11:26:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193721#M36411</guid>
      <dc:creator>Paari</dc:creator>
      <dc:date>2015-04-11T11:26:01Z</dc:date>
    </item>
    <item>
      <title>Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193722#M36412</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;1M records&amp;nbsp; 4 numeric of length 8 bytes that is 16Mbyte having 16 numeric 16Mbyte something so small&amp;nbsp; it will be cached in memory.&amp;nbsp; Nothing to bother about. Do you have specs that is more big? &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 11 Apr 2015 11:46:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193722#M36412</guid>
      <dc:creator>jakarman</dc:creator>
      <dc:date>2015-04-11T11:46:50Z</dc:date>
    </item>
    <item>
      <title>Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193723#M36413</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;A __default_attr="8872" __jive_macro_name="user" class="jive_macro jive_macro_user" data-objecttype="3" href="https://communities.sas.com/"&gt;&lt;/A&gt; Thank you for the reply. But the dataset is much bigger more 4 Millions observations and we have limited resource( Executing Capacity). So I am thinking to optimize in some way. If there is any possible way to reduce the execution time please let me know. &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 11 Apr 2015 12:59:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193723#M36413</guid>
      <dc:creator>Paari</dc:creator>
      <dc:date>2015-04-11T12:59:19Z</dc:date>
    </item>
    <item>
      <title>Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193724#M36414</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;if dataset has 4 millions row&amp;nbsp; then there will 128Mbyte data to cached and I don't know how well will SAS be able to tackle it with its cached memory. &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 11 Apr 2015 13:03:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193724#M36414</guid>
      <dc:creator>Paari</dc:creator>
      <dc:date>2015-04-11T13:03:09Z</dc:date>
    </item>
    <item>
      <title>Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193725#M36415</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Check your hardware settings, even UE is coming with the option set on about 2-3Gb. Your desktop is coming with at least 4Gb and can be easily 12G. Mostly server sessions even far higher.&lt;/P&gt;&lt;P&gt;With those low values (128 Mb) the IO OS systemcache will do. Optimizing buffersize at eg 128kb&amp;nbsp; and there will a 1000 buffers needs. Unless you are needing split second response time looking not a real issue.&lt;BR /&gt;Your selection looks simple (not complicated) your data looks simple (not complicated) unless you are having some challenges there I do not see a lot to optimize with all sequential processing on all data (fastest was) &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 11 Apr 2015 14:12:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193725#M36415</guid>
      <dc:creator>jakarman</dc:creator>
      <dc:date>2015-04-11T14:12:39Z</dc:date>
    </item>
    <item>
      <title>Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193726#M36416</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Like Jaap said, 4 million is NOT a large dataset for SAS.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'd be more concerned about whether your code is doing what you want. You have:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;if( a=xyz and b=lmn and c=hjk and d=cde) then;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;flag1=x1;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;flag2=x2;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;flag3=x3;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;else;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;flag1=y1;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;flag2=y2;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;flag3=y3;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;What do xya, lmn, hjk and cde represent? If they're not variables, the above code will fail.&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;Also, the code doesn't make sense as written. Don't you really want something like?:&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;if( a=xyz and b=lmn and c=hjk and d=cde) then do;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&amp;nbsp; flag1=x1;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&amp;nbsp; flag2=x2;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&amp;nbsp; flag3=x3;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;end;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;else do;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&amp;nbsp; flag1=y1;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&amp;nbsp; flag2=y2;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&amp;nbsp; flag3=y3;&lt;/P&gt;&lt;P style="font-size: 13px; font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;end;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 11 Apr 2015 14:59:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193726#M36416</guid>
      <dc:creator>art297</dc:creator>
      <dc:date>2015-04-11T14:59:06Z</dc:date>
    </item>
    <item>
      <title>Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193727#M36417</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Even 4 million records with 4 numeric variables is EXTREMELY SMALL for SAS. I'd say that at this time, "big" starts at files well above 10 GB in size.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you have trouble, show the log of your data step with fullstimer enabled.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 13 Apr 2015 05:54:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193727#M36417</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2015-04-13T05:54:32Z</dc:date>
    </item>
    <item>
      <title>Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193728#M36418</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Forgetting the fact that your dataset is too small for a significant performance improvement in SAS , fix your code by including where clause instead of if statements, which will give you a better performance gain theoretically.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 13 Apr 2015 06:24:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193728#M36418</guid>
      <dc:creator>ven</dc:creator>
      <dc:date>2015-04-13T06:24:30Z</dc:date>
    </item>
    <item>
      <title>Re: I have a big Dataset (1+ Million records) that have four columns and I want to compare all coloumns with particular value and then need to set some Flags if value matches and also if not matches?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193729#M36419</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Do all records receive the same set of flags when match occurs and similarly other set of flags when match fails? Do the flags are CONSTANT?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 13 Apr 2015 06:47:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/I-have-a-big-Dataset-1-Million-records-that-have-four-columns/m-p/193729#M36419</guid>
      <dc:creator>KachiM</dc:creator>
      <dc:date>2015-04-13T06:47:59Z</dc:date>
    </item>
  </channel>
</rss>

