Solved: Re: Output hash dataset if not found

airchang · Posted 10-26-2016 07:52 PM

Hi,

I used one dataset as a hash, with "ID" as its key. I also have other variables in the hash that I want to keep and I included them in defineData.

I'd like to output variables in the hash, where the "ID" can not be found in the main dataset, because I'm more interested in variables that dont exist in the main dataset.

The codes I'm using now are like this: if hash.find()^=o then output .... But it seems like it outputs variables in the main datasets instead of in the hash.

I can't do it by reversing the hash and main dataset, because the main dataset is far larger than my hash. What code should I use to solve it? Any help will be appreciated!

Ksharp · Posted 10-26-2016 10:07 PM

Remove it from Hash when you find it , and at the end of data step output Hash.

data _null_;
 set main end=last;
 ...........
 if hash.check()=o then hash.remove();
 if last then hash.output(dataset:'want');
 run;

View solution in original post

Shmuel · Posted 10-26-2016 08:13 PM

In similar situation I prefer the SQL.

Let datasets be: SMALL (instead hash) and BIG with ID common in both, then

proc sql;

create table WANT as select * from BIG

where ID not in (select ID from SMALL);

quit;

airchang · Posted 10-27-2016 01:26 PM

The dataset is really large (more than 400 GB), so I was suggested to use Hash. But thanks for your help!

Ksharp · Posted 10-26-2016 10:07 PM

Remove it from Hash when you find it , and at the end of data step output Hash.

data _null_;
 set main end=last;
 ...........
 if hash.check()=o then hash.remove();
 if last then hash.output(dataset:'want');
 run;

airchang · Posted 10-27-2016 01:27 PM

This works! Thanks!

Output hash dataset if not found

Re: Output hash dataset if not found

Re: Output hash dataset if not found

Re: Output hash dataset if not found

Re: Output hash dataset if not found

Re: Output hash dataset if not found

Catch up on SAS Innovate 2026

SAS Training: Just a Click Away