BookmarkSubscribeRSS Feed
strsljen
Obsidian | Level 7

Hi,

 

I have a use-case where table contains 1 or more rows for the same email address column. There is a columd "date".

From all rows with same email address, only the one with newest date should be taken in a result table, regardless of values of other columns (20-ish columns are in the table all together).

 

Is there a way to handle it in DI Studio without using custom written code?

 

Thanks!

 

Best regards,

 

--
Mario
5 REPLIES 5
Patrick
Opal | Level 21

@strsljen

Below two coding options should both be quite simple to implement in DIS using standard transformations.

data have;
  set sashelp.class;
  email_addr='abc.efg@blah.com';
  dt=today()+_n_;
  format dt date9.;
run;

/* option 1 */
proc sort data=have out=want1;
  by email_addr DESCENDING dt;
run;

proc sort data=want1 nodupkey;
  by email_addr;
run;

/* option 2 */
proc sql;
  create table want2 as 
    select *
    from have
    group by email_addr
    having max(dt)=dt
  ;
quit;
AngusLooney
SAS Employee

Personally, this is just a bit of "SQL maths", easily done with Extract nodes.

 

extract node to

  select email, max(date) as maxdate from table group by email into work.interim

 

extract node

  join that back onto the original table, joining on email = email and date = maxdate

 

All done in extract nodes.

 

Remember, DI Studio isn't a supposed to be programming tool.

strsljen
Obsidian | Level 7

Hi,

 

 

Makes sense. Thanks!

 

In the meantime, I checked RANK transformation and that does the trick - gives me ranking within parameters I need. Then I just take out rows with rank=1 for example.

I completelly agree about DI Studio not being programming tool. It is possible to run all in user-written code but we are avoiding it by all means.

 

 

Best regards,

--
Mario
Patrick
Opal | Level 21

@strsljen

Just to clarify: The two coding option I've posted weren't meant to be implemented as user written code but as logic using standard DIS transformations.

 

Option 1 can get implemented using two SORT transformations. 

Option 2 can get implemented using a SQL Join transformation.

 

Option 1 would should also allow to easily collect the rejected records in a second table.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to connect to databases in SAS Viya

Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 1114 views
  • 0 likes
  • 4 in conversation