<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: proc sql select certain distinct variables only in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590343#M168944</link>
    <description>&lt;P&gt;Thanks for your reply. Think about it again, would it matter whether I include 'counted' for distinct variables, because my understanding is that as counted = 'Yes' for every observation, it would not affect the final result as the account_id and month variables are effectively where any potential duplicates would be identified (i.e. if two observations have the same account_id and month, the 'counted' variable would not make a difference as all of the observations are 'Yes' anyway)?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;proc sql;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;create table unique_accounts as&lt;BR /&gt;&amp;nbsp; &amp;nbsp;select&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;FONT color="#FF0000"&gt;distinct account_id&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ,month&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;FONT color="#FF0000"&gt;,counted&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;from test;&lt;/P&gt;&lt;P&gt;quit;&lt;/P&gt;</description>
    <pubDate>Fri, 20 Sep 2019 11:57:28 GMT</pubDate>
    <dc:creator>jeremy4</dc:creator>
    <dc:date>2019-09-20T11:57:28Z</dc:date>
    <item>
      <title>proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590327#M168935</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have three variables in my dataset:&lt;/P&gt;&lt;P&gt;Account_id&lt;/P&gt;&lt;P&gt;Month&lt;/P&gt;&lt;P&gt;Counted (I used the retain statement, so that the Counted variable in the 'test' dataset has a value of 'Yes' for all of the observations in the dataset).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If I only want &lt;FONT color="#FF0000"&gt;distinct values of account_id and month only&lt;/FONT&gt; (i.e. &lt;FONT color="#FF0000"&gt;keep only one of the observations&lt;/FONT&gt; when there is BOTH the same account_ID AND month) but exclude the Counted variable from being used for distinct (as all of the observations will have a value of 'Yes' in the dataset), is the code below correct? Thanks!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;proc sql;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;create table unique_accounts as&lt;BR /&gt;&amp;nbsp; &amp;nbsp;select &lt;FONT color="#FF0000"&gt;distinct account_id&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ,month&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ,&lt;FONT color="#FF0000"&gt;Counted&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;from test;&lt;/P&gt;&lt;P&gt;quit;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;Partial example&amp;nbsp;&lt;/U&gt;&lt;/P&gt;&lt;P&gt;Account ID&amp;nbsp; &amp;nbsp; &amp;nbsp; Month&amp;nbsp; &amp;nbsp; &amp;nbsp; Counted&lt;/P&gt;&lt;P&gt;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201801&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201807&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;3&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201804&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201809&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201807&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Code should only keep one observation (account ID = 2, month=201807, Counted = Yes).&amp;nbsp;&lt;/P&gt;&lt;P&gt;Account ID&amp;nbsp; &amp;nbsp; &amp;nbsp; Month&amp;nbsp; &amp;nbsp; &amp;nbsp; Counted&lt;/P&gt;&lt;P&gt;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201801&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201807&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;3&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201804&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201809&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;&lt;STRIKE&gt;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201807&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/STRIKE&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I used the following code below (which works), but I also want the 'unique_accounts' table to include the variable 'Counted' (though 'Counted' should not be part of the distinct').&lt;/P&gt;&lt;P&gt;proc sql;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;create table unique_accounts as&lt;BR /&gt;&amp;nbsp; &amp;nbsp;select &lt;FONT color="#FF0000"&gt;distinct account_id&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ,month&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;from test;&lt;/P&gt;&lt;P&gt;quit;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 11:29:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590327#M168935</guid>
      <dc:creator>jeremy4</dc:creator>
      <dc:date>2019-09-20T11:29:21Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590334#M168939</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/266226"&gt;@jeremy4&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;I have three variables in my dataset:&lt;/P&gt;
&lt;P&gt;Account_id&lt;/P&gt;
&lt;P&gt;Month&lt;/P&gt;
&lt;P&gt;Counted (I used the retain statement, so that the Counted variable in the 'test' dataset has a value of 'Yes' for all of the observations in the dataset).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If I only want &lt;FONT color="#FF0000"&gt;distinct values of account_id and month only&lt;/FONT&gt; (i.e. &lt;FONT color="#FF0000"&gt;keep only one of the observations&lt;/FONT&gt; when there is BOTH the same account_ID AND month) but exclude the Counted variable from being used for distinct (as all of the observations will have a value of 'Yes' in the dataset), can someone please correct the code below (I have got an error message)? Thanks!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I used the following code below (which works), but I also want the 'unique_accounts' table to include the variable 'Counted' (though 'Counted' should not be part of the distinct').&lt;/P&gt;
&lt;P&gt;proc sql;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;create table unique_accounts as&lt;BR /&gt;&amp;nbsp; &amp;nbsp;select &lt;FONT color="#FF0000"&gt;distinct account_id&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ,month&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;from test;&lt;/P&gt;
&lt;P&gt;quit;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Distinct applies to all variables in the SELECT statement. So it sounds like you need to take a different approach. If you KNOW (which you do) that COUNTED is YES for every row, then the solution is simple.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc sql;
   create table unique_accounts as
   select distinct account_id
        ,month
        ,"YES" as counted
   from test;
quit;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 11:31:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590334#M168939</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-09-20T11:31:50Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590342#M168943</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/266226"&gt;@jeremy4&lt;/a&gt;&amp;nbsp; &amp;nbsp;Unique counts??&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Can you post a better representative sample plz? Your current sample is confusing&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 11:54:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590342#M168943</guid>
      <dc:creator>novinosrin</dc:creator>
      <dc:date>2019-09-20T11:54:31Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590343#M168944</link>
      <description>&lt;P&gt;Thanks for your reply. Think about it again, would it matter whether I include 'counted' for distinct variables, because my understanding is that as counted = 'Yes' for every observation, it would not affect the final result as the account_id and month variables are effectively where any potential duplicates would be identified (i.e. if two observations have the same account_id and month, the 'counted' variable would not make a difference as all of the observations are 'Yes' anyway)?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;proc sql;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;create table unique_accounts as&lt;BR /&gt;&amp;nbsp; &amp;nbsp;select&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;FONT color="#FF0000"&gt;distinct account_id&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ,month&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;FONT color="#FF0000"&gt;,counted&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;from test;&lt;/P&gt;&lt;P&gt;quit;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 11:57:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590343#M168944</guid>
      <dc:creator>jeremy4</dc:creator>
      <dc:date>2019-09-20T11:57:28Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590345#M168945</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/266226"&gt;@jeremy4&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;... would it matter whether I include 'counted' for distinct variables, because my understanding is that as counted = 'Yes' for every observation, it would not affect the final result as the account_id and month variables are effectively where any potential duplicates would be identified (i.e. if two observations have the same account_id and month, the 'counted' variable would not make a difference as all of the observations are 'Yes' anyway)?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;proc sql;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;create table unique_accounts as&lt;BR /&gt;&amp;nbsp; &amp;nbsp;select&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;FONT color="#FF0000"&gt;distinct account_id&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ,month&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;FONT color="#FF0000"&gt;,counted&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;from test;&lt;/P&gt;
&lt;P&gt;quit;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;The easiest answer is to run the code and find out.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 12:11:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590345#M168945</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-09-20T12:11:46Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590346#M168946</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There are 200,000 accounts in my dataset and I have been told that there are duplicates in the original dataset (only containing account_ID and month). I used a retain statement, so that the updated dataset now contains three variables (account_ID, month and Counted). Counted has a value of 'Yes' for all 200,000 observations.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As there are 200,000 accounts in my dataset, I was wondering how to use 'distinct' in proc sql, so that where observations match, only one is kept.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;Partial example&amp;nbsp;&lt;/U&gt;&lt;/P&gt;&lt;P&gt;Account ID&amp;nbsp; &amp;nbsp; &amp;nbsp; Month&amp;nbsp; &amp;nbsp; &amp;nbsp; Counted&lt;/P&gt;&lt;P&gt;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201801&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201807&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;3&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201804&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201809&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201807&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;5&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201810&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;3&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201804&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Desired proc sql table output (duplicates removed). Pulls in all three variables (account_id, month and counted) but when there are duplicate observations, only one is kept. Effectively, this means only looking at distinct values of account_ID and month, as if they are different or duplicated, the counted variable will not matter as all observations have a value of 'Yes' anyway.&lt;/P&gt;&lt;P&gt;Account ID&amp;nbsp; &amp;nbsp; &amp;nbsp; Month&amp;nbsp; &amp;nbsp; &amp;nbsp; Counted&lt;/P&gt;&lt;P&gt;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201801&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201807&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;3&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201804&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;4&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201809&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;&lt;STRIKE&gt;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201807&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/STRIKE&gt;&lt;/P&gt;&lt;P&gt;5&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201810&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/P&gt;&lt;P&gt;&lt;STRIKE&gt;3&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 201804&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Yes&lt;/STRIKE&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can someone explain the difference in outcome if I used the two versions of code, and which one would be best to created the output required above?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Version 1&lt;/P&gt;&lt;P&gt;proc sql;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;create table unique_accounts as&lt;BR /&gt;&amp;nbsp; &amp;nbsp;select&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;FONT color="#FF0000"&gt;distinct account_id&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ,month&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;FONT color="#FF0000"&gt;,counted&lt;/FONT&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;from test;&lt;/P&gt;&lt;P&gt;quit;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Version 2&lt;/P&gt;&lt;PRE class=" language-sas"&gt;&lt;CODE class="  language-sas"&gt;&lt;SPAN class="token procnames"&gt;proc&lt;/SPAN&gt; &lt;SPAN class="token procnames"&gt;sql&lt;/SPAN&gt;&lt;SPAN class="token punctuation"&gt;;&lt;/SPAN&gt;
   create &lt;SPAN class="token statement"&gt;table&lt;/SPAN&gt; unique_accounts as
   &lt;SPAN class="token statement"&gt;select&lt;/SPAN&gt; &lt;SPAN class="token keyword"&gt;distinct&lt;/SPAN&gt; account_id
        &lt;SPAN class="token punctuation"&gt;,&lt;/SPAN&gt;&lt;SPAN class="token function"&gt;month&lt;/SPAN&gt;
        &lt;SPAN class="token punctuation"&gt;,&lt;/SPAN&gt;&lt;SPAN class="token string"&gt;"YES"&lt;/SPAN&gt; as counted
   &lt;SPAN class="token keyword"&gt;from&lt;/SPAN&gt; test&lt;SPAN class="token punctuation"&gt;;&lt;/SPAN&gt;
&lt;SPAN class="token procnames"&gt;quit&lt;/SPAN&gt;&lt;SPAN class="token punctuation"&gt;;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 12:18:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590346#M168946</guid>
      <dc:creator>jeremy4</dc:creator>
      <dc:date>2019-09-20T12:18:38Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590350#M168947</link>
      <description>I am missing something. Since Counted is the same for all observations, what is the problem with "select distinct AccountId, Month, Counted" ?</description>
      <pubDate>Fri, 20 Sep 2019 12:19:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590350#M168947</guid>
      <dc:creator>gamotte</dc:creator>
      <dc:date>2019-09-20T12:19:39Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590352#M168949</link>
      <description>That's my question as Counted is the same for all observations, so will it identify distinct observations and produce the required output based on version 1, or does version 2 have to be used (as suggested in a reply)?</description>
      <pubDate>Fri, 20 Sep 2019 12:25:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590352#M168949</guid>
      <dc:creator>jeremy4</dc:creator>
      <dc:date>2019-09-20T12:25:21Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590354#M168950</link>
      <description>&lt;P&gt;Assuming you have a variable Counted in the input, dataset, your version is rather straight forward. What select distinct does is&lt;/P&gt;
&lt;P&gt;1. sort&lt;/P&gt;
&lt;P&gt;2. eliminate&lt;/P&gt;
&lt;P&gt;based upon values in respective position&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Version 2:&lt;/P&gt;
&lt;P&gt;No counted variable in input dataset&lt;/P&gt;
&lt;P&gt;So the process is&lt;/P&gt;
&lt;P&gt;1. Assignment statement "yes" as counted will execute first&lt;/P&gt;
&lt;P&gt;2. Sort&lt;/P&gt;
&lt;P&gt;3. Eliminate&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Now&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/266226"&gt;@jeremy4&lt;/a&gt;&amp;nbsp; you can choose&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 12:26:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590354#M168950</guid>
      <dc:creator>novinosrin</dc:creator>
      <dc:date>2019-09-20T12:26:00Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590357#M168952</link>
      <description>&lt;P&gt;Perhaps Proc SQL does not have this functionality, I suggest you read this article:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://communities.sas.com/t5/SAS-Programming/Difference-between-NOdup-and-NoDupkey/m-p/29490/highlight/true#M66149" target="_self"&gt;Difference between NOdup and NoDupkey..??&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 12:27:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590357#M168952</guid>
      <dc:creator>PhilC</dc:creator>
      <dc:date>2019-09-20T12:27:24Z</dc:date>
    </item>
    <item>
      <title>Re: proc sql select certain distinct variables only</title>
      <link>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590359#M168954</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/266226"&gt;@jeremy4&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;That's my question as Counted is the same for all observations, so will it identify distinct observations and produce the required output based on version 1, or does version 2 have to be used (as suggested in a reply)?&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Why don't you try them and find out? You'll have your answer in about 10 seconds.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2019 12:29:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/proc-sql-select-certain-distinct-variables-only/m-p/590359#M168954</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-09-20T12:29:54Z</dc:date>
    </item>
  </channel>
</rss>

