BookmarkSubscribeRSS Feed
san
Calcite | Level 5 san
Calcite | Level 5

Hi there,

 

I am currently working with claims dataset. I need to get patient level claims dataset. My dataset looks like:

ClaimID  Patient ID   Rxflag  Filldate       Filldatethru

1              5                 1        08/09/10      09/08/10

2              5                 1        10/08/10      11/09/10

3              5                 0        10/08/10      12/13/10

4              6                 1        03/09/11       05/10/11

5              6                 0        06/10/12       07/13/11   

 

I need to create variables named start_date and end_date based on unique patient ID to capture the date of first and last fill. I tried using something like this:

 

 

data PATIENT_LEVEL_EVER_FLAGS (keep=pt_id drug_ever);
	set FLAGGED_CLAIMS;
	by pt_id;
	label drug_ever="Ever have drug code of interest";

	if first.pt_id then
		do;
			drug_ever=0;
		end;

	if rx_flag=1 then
		drug_ever=1;

	if drug_ever=1 then
		startdate=filldate;

	if last.pt_id then
		output;

	if last.pt_id then
		enddate=filldatethru;
run;

 

I am confused with the concept. Could someone please explain me the basics behind this?  Any suggestions on how to proceed will be of great help! Thanks.

9 REPLIES 9
SASKiwi
PROC Star

A better option might be SQL. This should be fairly close:

 

proc sql;
  create table patients as
   select patientid
             ,count(*) as patientvisits
             ,min(filldate) as firstfilldate format = date9.
             ,max(filldatethru)  as lastfilldatethru format = date9.
  from flagged_claims
  where rx_flag = 1
  group by patientid
  ;
quit;
san
Calcite | Level 5 san
Calcite | Level 5

Hi SASKiwi,

 

Thank you for your input and SQL codes. I will try using this approach to see if it works. 

PGStats
Opal | Level 21

Or, if you want to keep patients without any rx_flag=1 records 

 

proc sql;
  create table patients as
   select patientId
             ,count(*) as patientVisits
             ,min(case when rx_flag then filldate else . end) as firstfilldate format = date9.
             ,max(case when rx_flag then filldatethru else . end)  as lastfilldatethru format = date9.
  from flagged_claims
  group by patientId
  ;
quit;
PG
PGStats
Opal | Level 21

Or, if you prefer a data step (should be more efficient)

 

data patients;
do until(last.patientid);
	set flagged_claims; by patientId;
	if rx_flag then do;
		firstFillDate = min(firstFillDate, fillDate);
		lastFillDateThru = max(lastFillDateThru, fillDateThru);
		end;
	end;
keep patientId firstFillDate lastFillDateThru;
format firstFillDate lastFillDateThru yymmdd10.;
run;
PG
san
Calcite | Level 5 san
Calcite | Level 5

Hello PGStats,

 

Thank you for providing the SQL method for keeping all patient-level observations. It worked perfectly. (I now have start date for every patient who was prescribed the drug and no start date for every patient who was not prescribed the drug).

 

Thanks a lot!

san
Calcite | Level 5 san
Calcite | Level 5

Hi SASKiwi,

 

The SQL method worked great. It just keeps the observation with RX_flag=1(because of the Where clause). But, I needed to keep all the observations (those without Rx_flag=1 also), hence I modified your method using suggestions made by PGStats.

 

Thanks a ton! 

Reeza
Super User

Based on the sample dataset abouve what would you expect as output?

And what are you having issues with in your code?

 

Hint: Look into RETAIN to keep variable value across rows.

san
Calcite | Level 5 san
Calcite | Level 5

Hi Reeza,

Thanks for your inputs. I expect that the output will have all the variables given in the sample and will create two new variables 'startdate' and 'enddate'. Startdate represents when the first prescription was filled(is equal to filldate). Enddate is when the prescription was stopped(is equal to filldatethru). So, I need to keep start and end dates for each unique patient who was prescribed a drug (whose rx_flag=1). However, when I try running the code which I have mentioned, it gives me missing values for startdate even for rx_flag=1. So, I am not sure how I should be specifying that condition. Should it be

 

if rx_flag=1 then startdate=filldate?

 

I will look into the RETAIN statement that you mentioned. Thanks again!

shravs
Calcite | Level 5

Hi San, 

If I understand your question correctly, you want startdate = fill date and enddate = fillthrudate when rx_flag = 1 otherwise startdate = filldate. 

 

Try the solution below: 

 

data patientrx;
  set filldate ;
  by patient_id;
if rx_flag = 1 then 
  do ;
   startdate = filldate;
   enddate = filldatethru;
   end;
else; do;
  startdate = filldate;
  end;
  format startdate enddate mmddyy9.;
run;



 

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 9 replies
  • 1448 views
  • 0 likes
  • 5 in conversation