DATA Step, Macro, Functions and more

splitting data

Accepted Solution Solved
Reply
Super Contributor
Posts: 420
Accepted Solution

splitting data

Hi,

I have a big file and want to split it into different smaller file with the same record and fewer column each file.

In the below sample, I will split have into b and c file.

My issue is that I want to delete the columns already used so the have file becomes smaller after each round.

Of course I can do something like " data have; set have; drop..." but it takes time to run that data step.

 

Thank you.

 


data have;
input x b1 b11 b2 b22;
datalines;
1 5 10 20 1
2 5 1 2 2
3 50 100 200 3
;run;

%macro split_data (rr=);
data data_&rr ; set have;
keep x b&rr.: ;run; 
%mend;
%split_data (rr=1);
%split_data (rr=2);
run;

Accepted Solutions
Solution
‎02-23-2017 05:51 PM
PROC Star
Posts: 7,492

Re: splitting data

Why not do it all in one data step? e.g.,

 

data have;
input x b1 b11 b2 b22;
datalines;
1 5 10 20 1
2 5 1 2 2
3 50 100 200 3
;
run;

data data1 (keep=x b1--b11)
         data2 (keep=x b2--b22);
  set have;
run;

Art, CEO, AnalystFinder.com

 

View solution in original post


All Replies
Super User
Posts: 19,870

Re: splitting data

If you're splitting your table via columns you may as well set up a fact/dim table structure using a star or snowflake schema. 

If thats an option. 

 

What makes your file 'big'?

Solution
‎02-23-2017 05:51 PM
PROC Star
Posts: 7,492

Re: splitting data

Why not do it all in one data step? e.g.,

 

data have;
input x b1 b11 b2 b22;
datalines;
1 5 10 20 1
2 5 1 2 2
3 50 100 200 3
;
run;

data data1 (keep=x b1--b11)
         data2 (keep=x b2--b22);
  set have;
run;

Art, CEO, AnalystFinder.com

 

Super Contributor
Posts: 420

Re: splitting data

Perfect, Art297.
The combine all in 1 data step save a lot of time.
HHC
☑ This topic is solved.

Need further help from the community? Please ask a new question.

Discussion stats
  • 3 replies
  • 153 views
  • 1 like
  • 3 in conversation