Split sample into N subsamples with equal number of obs

Accepted Solution Solved
Reply
Super Contributor
Posts: 420
Accepted Solution

Split sample into N subsamples with equal number of obs

Hello Everyone,

I have a large sample and I want to split it into subsamples (new files) with equal number of observation to process one by one.

So say in the below sample with 10 obs, how can I ask SAS to make a 5 NEW FILEs A1 A2 A3 A4 A5 , each has 2 obs.

I can do the calculation of number obs and number of subsample manually myself.

Thank you for your help.

HHC

data a;

input id value;

datalines;

1 2

2 3

3 2

4 6

5 9

6 6

7 95

8 9

9 12

10 1

;run;


Accepted Solutions
Solution
‎11-08-2013 05:57 PM
Super User
Posts: 11,343

Re: Split sample into N subsample with equal number of obs

If you want to add a grouping variable one way would be:

groupid = mod(_n_,number of groups);

Which will mark every nth record. If your dataset is not an exact multiple of n then you will have one more record for some groups.

The default values will go 0 to n-1, if you want 1 to n then add 1:

groupid = 1+ mod(_n_,number of groups);

View solution in original post


All Replies
Solution
‎11-08-2013 05:57 PM
Super User
Posts: 11,343

Re: Split sample into N subsample with equal number of obs

If you want to add a grouping variable one way would be:

groupid = mod(_n_,number of groups);

Which will mark every nth record. If your dataset is not an exact multiple of n then you will have one more record for some groups.

The default values will go 0 to n-1, if you want 1 to n then add 1:

groupid = 1+ mod(_n_,number of groups);

🔒 This topic is solved and locked.

Need further help from the community? Please ask a new question.

Discussion stats
  • 1 reply
  • 574 views
  • 0 likes
  • 2 in conversation