Hi Everyone,
I have a set of data in long form that contains missing data. I'd like to collapse by the ID and have 1 row per ID.
Here is what I have:
ID | InitialCount | FinalCount |
1 | 0 | . |
1 | . | . |
1 | . | . |
1 | . | . |
1 | . | 0 |
2 | 3 | . |
2 | . | . |
2 | . | . |
2 | . | . |
2 | . | 0 |
3 | 0 | . |
3 | . | . |
3 | . | . |
3 | . | 7 |
Here is what I want:
ID | InitialCount | FinalCount |
1 | 0 | 0 |
2 | 3 | 0 |
3 | 0 | 7 |
Any help would be greatly appreciated! Thanks in advance!
The easiest program might be:
proc summary data=have nway;
by id;
var InitialCount FinalCount;
output out=want (drop=_type_ _freq_) max=;
run;
You can use a more complex program that doesn't require you to specify the variable names:
data want;
update have (obs=0) have;
by id;
run;
For now, I suspect that learning more about PROC SUMMARY would be useful.
Can you be sure that your data always has only 1 nonmissing obervation for InitialCount and 1 nonmissing obervation for FinalCount for each ID?
ok. There u go 🙂
data have;
input ID InitialCount FinalCount;
datalines;
1 0 .
1 . .
1 . .
1 . .
1 . 0
2 3 .
2 . .
2 . .
2 . .
2 . 0
3 0 .
3 . .
3 . .
3 . 7
;
proc sort data = have;
by ID;
run;
data help;
set have;
by ID;
if first.ID or last.ID then output;
run;
data want(drop = lag);
set help;
by ID;
lag = lag(InitialCount);
if last.ID then do;
InitialCount = lag;
output;
end;
run;
The easiest program might be:
proc summary data=have nway;
by id;
var InitialCount FinalCount;
output out=want (drop=_type_ _freq_) max=;
run;
You can use a more complex program that doesn't require you to specify the variable names:
data want;
update have (obs=0) have;
by id;
run;
For now, I suspect that learning more about PROC SUMMARY would be useful.
Cool solutions @Astounding 🙂
proc summary data=have nway;
class id;
var initialcount finalcount;
output out=want (drop= _:) max= ;
run;
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.