BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
bnarang
Calcite | Level 5

Hi

I have a very simple sorted data with observation as ID and Indicator, I would like to form a cluster for each ID with increments for each group of records. It should reset for every account type. I have thought a lot about it but no success so far. Please help . CLUSTER IS Required Output

 

ID         Indicator   Cluster

A11        .               .

A11        .               .

A11        1              1

A11        1              1

A11        1              1

A11        .

A11        .

A11        1              2

A11        1              2

A11        .

A11        1             3

A11        1             3

B11        .

B11        1             1

B11        1             1

B11        1             1

B11        .

B11        1             2

B11         .

B11        .


1 ACCEPTED SOLUTION

Accepted Solutions
Tom
Super User Tom
Super User

Use BY processing with NOTSORTED option.  To zero out the CLUSTER number when indicator is missing you will need a second variable.

data have ;

input id $ indicator want;

cards;

A11 . .

A11 . .

A11 1 1

A11 1 1

A11 1 1

A11 . .

A11 . .

A11 1 2

A11 1 2

A11 . .

A11 1 3

A11 1 3

B11 . .

B11 1 1

B11 1 1

B11 1 1

B11 . .

B11 1 2

B11 . .

B11 . .

run;

data want ;

  set have ;

  by id indicator notsorted ;

  if first.id then ncluster=0;

  if first.indicator and not missing(indicator) then ncluster+1;

  if not missing(indicator) then cluster=ncluster;

  put id indicator want cluster;

  drop ncluster ;

run;

View solution in original post

2 REPLIES 2
Tom
Super User Tom
Super User

Use BY processing with NOTSORTED option.  To zero out the CLUSTER number when indicator is missing you will need a second variable.

data have ;

input id $ indicator want;

cards;

A11 . .

A11 . .

A11 1 1

A11 1 1

A11 1 1

A11 . .

A11 . .

A11 1 2

A11 1 2

A11 . .

A11 1 3

A11 1 3

B11 . .

B11 1 1

B11 1 1

B11 1 1

B11 . .

B11 1 2

B11 . .

B11 . .

run;

data want ;

  set have ;

  by id indicator notsorted ;

  if first.id then ncluster=0;

  if first.indicator and not missing(indicator) then ncluster+1;

  if not missing(indicator) then cluster=ncluster;

  put id indicator want cluster;

  drop ncluster ;

run;

bnarang
Calcite | Level 5

Many Thanks Tom. Don't know the usage of NotSorted option but will read over the internet.

Thanks again

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1242 views
  • 0 likes
  • 2 in conversation