DATA Step, Macro, Functions and more

Select 3 rows for each unique ID

Reply
Frequent Contributor
Posts: 75

Select 3 rows for each unique ID

Each unique ID have many different rows of  addresses, I want a program to select for each unqiue ID any 3 rows of addressses. 

 

Thank you.

 

Example of dataset:

 

ID  Address

1    A

1    B

1    C

2    HH

2    KK

2    NN

2    MM

 

  

Super User
Posts: 5,260

Re: Select 3 rows for each unique ID

If the data is sorted as per your example, use data step with BY ID, and use a counter that is re-initiated for each first.ID.

Use explicit output when the counter is <= 3.

Data never sleeps
Super User
Super User
Posts: 7,430

Re: Select 3 rows for each unique ID

Just a slight alteration of @LinusH answer uses lag to decide if to output or not (saves a variable and line of code):

data have;
  input id  address $;
datalines;
1    A
1    B
1    C
2    HH
2    KK
2    NN
2    MM
;
run;
data want;
  set have;
  by id;
  if first.id or lag(first.id) or lag2(first.id) then output;
run;
Super User
Posts: 9,691

Re: Select 3 rows for each unique ID

data have;
  input id  address $;
datalines;
1    A
1    B
1    C
2    HH
2    KK
2    NN
2    MM
;
run;

proc surveyselect data=have out=want sampsize=3;
strata id;
run;
Ask a Question
Discussion stats
  • 3 replies
  • 116 views
  • 3 likes
  • 4 in conversation