Solved: Re: Using retain or lag

ChuksManuel · Posted 07-04-2019 05:17 AM

Hello programmers,

I am trying to use the retain or lag function to help me calculate the Incidence density ratios.

My IDR= Incidence density for value=1/ incidence density for value=0 and i want to output only the last observations.

Please i'll be glad if anyone can give me an idea on how to go about this? I want to calculate the respective Incidence density for I_DPD, I2_DPY, I_AnginaPD and I2_AnginaPY.

data one;
input	tc	tn $	value	I_DPD	I2_DPY	I_AnginaPD	I2_AnginaPY;
datalines;
	1	Exhaustion	0	0.016	6.07	0.002	0.904
	1	Exhaustion	1	0.016	5.91	0.003	1.265
	2	Problemwalking	0	0.016	6.140	0.002	0.970
	2	Problemwalking	1	0.015	5.71	0.004	1.004
	3	ProblemStanding	0	0.016	6.17	0.008	0.95
	3	ProblemStanding	1	0.014	5.43	0.005	0.98
; run;
proc print; run;

Reeza · Posted 07-04-2019 04:18 PM

Rather than lag, why not do a merge?

data want;
merge one (where=value=0)
one(where=value=1 rename = (I_DPD = I_DPD1 I2_DPY = I2_DPY1 ....));

array values0(*) I_dpd i2_dpy i_anginapd i2_anginapy;
array values1(*) .....;
array want(4) diff1-diff4;
do i=1 to 4;
want(i) = values1(i) / values0(i);
end;
run;

At least that's one way, not quite dynamic. If you want a fully dynamic solution, it's likely worth transposing your data to a long format.

View solution in original post

PeterClemmensen · Posted 07-04-2019 05:24 AM

So what does your desired result from this data look like?

The DATA to DATA Step Macro
Blog: SASnrd

ChuksManuel · Posted 07-04-2019 05:42 AM

Hello,

I basically want to get the incidence density ratio (Incidence in observation with value =1 / incidence in observation with value=0) by each 'tc' .

I want to have an output with four incidence densities in the columns as IDR_DPD, IDR_DPY, IDR_AnginaPD, IDR_AginaPY.

I did this first code with a lag function to get the incidence density , IDR_DPD. And i can do that for the rest but i want to know how i can do this with a retain function.

data one1;
set one;
by tc;
lag_I_dpd=lag(I_dpd);
IDR_DPD= I_dpd/lag_I_dpd;
run;
proc print; run;

Kurt_Bremser · Posted 07-04-2019 06:10 AM

Don't describe the expected result, show it. As you can see, there seems to be some difficulty infering your requirements from the description alone.

Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
The macro for direct download as ZIP
How to post code
Please vote for Provide Sequential Search Capability for Hash Objects
How to deal with locked files on UNIX

ScottBass · Posted 07-04-2019 08:01 PM

@Kurt_Bremser wrote:

Don't describe the expected result, show it. As you can see, there seems to be some difficulty infering your requirements from the description alone.

Yes, what @Kurt_Bremser said. I don't know what "incidence density" is. I suppose I could Google it. But I consider myself a good SAS programmer, so could probably help you if I knew what your target data was.

So, provide data steps as "have" (your source data) and "want" (your target results). They should be self-contained data steps using the datalines statement, entered using the "Insert SAS code" icon (so the code does not get reformatted). The code should be something we can cut-and-paste from here into SAS and it runs without error.

Once we have that information, we can code a solution that matches your target data.

Otherwise, if you describe your data, then we will describe the code you need to write.

P.S.: Click, read, and comprehend the last three links in Kurt's signature block.

Please post your question as a self-contained data step in the form of "have" (source) and "want" (desired results).
I won't contribute to your post if I can't cut-and-paste your syntactically correct code into SAS.

Tom · Posted 07-04-2019 10:47 AM

If I am understanding this you just want to divide by the value of your measures on the non-ZERO rows with the value of the measure from the ZERO row.

So if you have data like:

data have ;
   input ID VALUE MEASURE ;
cards;
1 0 100
1 1 50
2 0 200
2 1 60
;

You want to get a result that looks like:

data want;
   input ID RATIO  ;
cards;
1 0.50 
2 0.30
;

Only you have more than one analysis variable.

ChuksManuel · Posted 07-04-2019 09:17 PM

Hello Tom,

That's exactly what i want to do. To find the ratio between the lower and the upper.

ChuksManuel · Posted 07-04-2019 06:19 AM

Thank you for the response.

Please final output would be something like this

data finaloutput;
input	tc	tn $	value	I_DPD	I2_DPY	I_AnginaPD	I2_AnginaPY IDR_DPD IDR_DPY IDR_AnginaPD IDR_AnginaPY;
datalines;
	1	Exhaustion	1	0.016	5.91	0.003	1.265 . . . .
	2	Problemwalking	1	0.015	5.71	0.004	1.004 . . . .
	3	ProblemStanding	1	0.014	5.43	0.005	0.98 . . . .
; run;
proc print; run;

Kurt_Bremser · Posted 07-04-2019 06:31 AM

So you just apply a where condition for value = 1 and add four variables that are always missing?

Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
The macro for direct download as ZIP
How to post code
Please vote for Provide Sequential Search Capability for Hash Objects
How to deal with locked files on UNIX

Reeza · Posted 07-04-2019 04:18 PM

Rather than lag, why not do a merge?

data want;
merge one (where=value=0)
one(where=value=1 rename = (I_DPD = I_DPD1 I2_DPY = I2_DPY1 ....));

array values0(*) I_dpd i2_dpy i_anginapd i2_anginapy;
array values1(*) .....;
array want(4) diff1-diff4;
do i=1 to 4;
want(i) = values1(i) / values0(i);
end;
run;

At least that's one way, not quite dynamic. If you want a fully dynamic solution, it's likely worth transposing your data to a long format.

SAS Innovate 2025: Register Now

SAS Training: Just a Click Away