SAS Procedures

Paul_NYS · Posted 02-02-2016 12:36 PM

Hi Everyone

I would like to collapse the below records (screenshot) that have the same entity_id into one record and keep all the column values, where one is populated.

Is there a fairly straight forward way of doing this?

Paul

FreelanceReinh · Posted 02-02-2016 12:48 PM

Hi Paul,

Try this:

proc summary data=have nway;
class entity_id;
var SubPh:;
output out=want(drop=_:) max=;
run;

View solution in original post

FreelanceReinh · Posted 02-02-2016 12:48 PM

Hi Paul,

Try this:

proc summary data=have nway;
class entity_id;
var SubPh:;
output out=want(drop=_:) max=;
run;

Paul_NYS · Posted 02-02-2016 01:00 PM

Thanks a lot Freelance! That did it.

I have not used Proc Summary before. What do the below aspects of it do?

(drop=_:) max=

Paul

FreelanceReinh · Posted 02-02-2016 01:21 PM

Glad to read that it worked for you.

Maybe you have used (or heard of) PROC MEANS? It's almost the same, but writes to the output window by default.

The option MAX= of the OUTPUT statement says that

for each analysis variable specified in the VAR statement* the maximum is the summary statistic to be computed.
The names of the variables in the output dataset (here: WANT) containing the summary statistics shall be the names of the corresponding analysis variables (i.e., the maximum of SubPh1 shall be stored in a variable SubPh1, etc.). Otherwise, the new names would need to be listed after "MAX=".

By default, the output dataset contains variables _TYPE_ and _FREQ_ containing additional information about the summary: In your example, _TYPE_=1 for all observations, hence not very interesting, _FREQ_ = number of observations summarized, i.e. 3 for entity_id=165771, 5 for entity_id=230674, ... Assuming that you don't need these variables, I dropped them. More precisely: I dropped all variables whose names start with an underscore.

* Here, the VAR statement contains the list of all variables in dataset HAVE whose names start with "SubPh" (assuming that these are exactly your intended analysis variables).

SAS Procedures

Collapse multiple records into one and combine column values

Re: Collapse multiple records into one and combine column values

Re: Collapse multiple records into one and combine column values

Re: Collapse multiple records into one and combine column values

Re: Collapse multiple records into one and combine column values

Combine multiple columns into one

Combine and Condense Duplicate Records

Collapse occurrencies (indexes) present in multiple rows into one sing...

Creating a grouper column from multiple columns, diagonal records

Combine multiple values into one value

Follow Us

What is...

SAS Procedures

Our biggest data and AI event of the year.

SAS Training: Just a Click Away

Follow Us

What is...