BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
abqdiane
Obsidian | Level 7

Hello,

I have a list of members that can be enrolled in a club. Some members stay continuously enrolled, while other stop and start.  I'm trying to find those members who stopped  during the year and then re-enrolled and how long the gap is.  Each time a member re-enrolls, a new record is created. So some members will only have one record, but other could have 4 or 5 records.  In the example below, I have members for just one year, but the records go back for three to four years.  Any suggestions?  Would an array work?  I thought about transpose, but there are over 80,000 records.  I tried a retain function to come up with one row per member, but I couldn't get it to work.

 

From the  example below ,member 111 has 12 continuous months, but member 122 has a one month gap.  

 

I'd appreciate any help.

 ID Start_Date End_Date
111 01/01/2017 12/31/2017
122 01/01/2017 06/30/2017
122 08/01/2017 12/31/2017
133 01/01/2017 06/30/2017
133 07/01/2017 12/31/2017
144 1/1/2017 02/28/2017
144 04/01/2017 06/30/2017
144 08/01/2017 10/31/2017
144 11/01/2017 12/31/2017

 

1 ACCEPTED SOLUTION

Accepted Solutions
Astounding
PROC Star

Assuming your data set is sorted as indicated, you could use:

 

data want;

set have;

by id;

gap_size = start_date - lag(end_date);

if first.id = 0 and gap_size > 1;

run;

 

This gives you only the observations that begin after a gap.  If you want all the records, you could use a similar variation:

 

data want;

set have;

by id;

gap_size = start_time - lag(end_date);

if first.id then gap_size = 0;

run;

 

According to the examples you gave, there is no gap when gap_size is either 0 or 1.

View solution in original post

2 REPLIES 2
Astounding
PROC Star

Assuming your data set is sorted as indicated, you could use:

 

data want;

set have;

by id;

gap_size = start_date - lag(end_date);

if first.id = 0 and gap_size > 1;

run;

 

This gives you only the observations that begin after a gap.  If you want all the records, you could use a similar variation:

 

data want;

set have;

by id;

gap_size = start_time - lag(end_date);

if first.id then gap_size = 0;

run;

 

According to the examples you gave, there is no gap when gap_size is either 0 or 1.

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
Mastering the WHERE Clause in PROC SQL

SAS' Charu Shankar shares her PROC SQL expertise by showing you how to master the WHERE clause using real winter weather data.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 509 views
  • 1 like
  • 2 in conversation