SAS Programming

DATA Step, Macro, Functions and more
BookmarkSubscribeRSS Feed
venkatard
Calcite | Level 5

How to split a dataset 80 - 20 percent with common id  i.e i want to split data by id(80-20percentage of data to be splitted on basis of id)

id      score    forum

12     89          98    

12     87          67    

13     56          87    

13     45          98    

14     78          98

15    23           87    

16    54          23

3 REPLIES 3
AncaTilea
Pyrite | Level 9

Hi,

Given the example you provided, how do you want the end result to look like?

Thanks.

DBailey
Lapis Lazuli | Level 10

Presuming that you want all of the records associated with 80% of unique IDs to be identified:

data have;

input id score forum;

cards;

12     89          98   

12     87          67   

13     56          87   

13     45          98   

14     78          98

15    23           87   

16    54          23

;

proc sql;

create table ids as select distinct id, 0 as id_rand_val from work.have order by id;

update ids set id_rand_val=rand('uniform');

create table want as

select

    t1.*,

    case when t2.id_rand_val <= .8 then 'Group1' else 'Group2' end as ID_Group

from

    work.have t1

    inner join work.ids t2

        on t1.id=t2.id;

quit;

data_null__
Jade | Level 19

SELECTED=1 is the RATE= sample in this case the 20%.  Therefore SELECTED=0 would be the 1-rate part.

data score;
   input id $     score    forum;
   cards;
12     89          98    
12     87          67    
13     56          87    
13     45          98    
14     78          98
15    23           87    
16    54          23
;;;;
   run;
proc surveyselect seed=2 rate=.2 outall;
  
SAMPLINGUNIT id;
   run;

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 4332 views
  • 1 like
  • 4 in conversation