Re: SAS query - Split data to different dataset

scb · Posted 10-31-2020 10:03 PM

May I know how to split dataset into diff dataset based on diff team? I could have 900 teams. Thanks.

DATA have;
INPUT ID $ Team $ Point;
DATALINES;
111 T1 100
112 T1 300
113 T1 600

111 T2 550
112 T2 770
113 T2 890

111 T3 1000
112 T3 3003
113 T3 6003
;
run;

Desired result

dataset1 name: T1

ID Team Point

111 T1 100
112 T1 300
113 T1 600

dataset2 name: T2

ID Team Point

111 T2 550
112 T2 770
113 T2 890

dataset3 name: T3

ID Team Point

111 T3 1000
112 T3 3003
113 T3 6003

novinosrin · Posted 10-31-2020 10:30 PM


DATA have;
INPUT ID $ Team $ Point;
DATALINES;
111 T1 100
112 T1 300
113 T1 600
111 T2 550
112 T2 770
113 T2 890
111 T3 1000
112 T3 3003
113 T3 6003
;
run;

data _null_;
 if _n_=1 then do;
  dcl hash h(dataset:'have(obs=0)',multidata:'y',ordered:'y');
  h.definekey('id');
  h.definedata(all:'y');
  h.definedone();
 end;
 do until(last.team);
  set have;
  by team;
  h.add();
 end;
 h.output(dataset:team);
 h.clear();
run;

Ksharp · Posted 11-01-2020 03:36 AM

If your table was not sorted and not big.

DATA have;
INPUT ID $ Team $ Point;
DATALINES;
111 T1 100
112 T1 300
113 T1 600
111 T2 550
112 T2 770
113 T2 890
111 T3 1000
112 T3 3003
113 T3 6003
;
run;
proc freq data=have noprint;
table team/out=levels nopercent;
run;
data _null_;
 set levels;
 call execute(cat('data ',team,';set have;if team="',strip(team),'";run;'));
run;

Kurt_Bremser · Posted 11-01-2020 03:37 AM

The most important question here: what for?

In 99% of cases, splitting a dataset makes further processing more complicated and less efficient. So my answer is:

DON'T DO IT!

Please provide more details if you think you can persuade me that splitting your dataset makes sense.

Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
The macro for direct download as ZIP
How to post code
Please vote for Provide Sequential Search Capability for Hash Objects
How to deal with locked files on UNIX

Reeza · Posted 11-02-2020 12:36 AM

I can almost guarantee there's no good reason to do this in SAS...especially 900 times.

SAS query - Split data to different dataset