Programming the statistical procedures from SAS

Boosting Sampling - Please Help Thank you

Reply
Frequent Contributor
Posts: 95

Boosting Sampling - Please Help Thank you

Hi All,

I have never used boosting sampling, please see the code below! Could anyone explain what this code does and what's the objective of the boosting technique?

I started with train_pot that has a target1=4 and target0=34 and I end up with a  train_final that has  target1=34 and target0=34 (50:50).

Your help would be much appreciated.

Many Thanks

/* Boost the training file to have a 50:50 target1 vs target0 */    

proc surveyselect data=train_pot

     out=train_pot

     method=urs

     sampsize=&max_cust.

     seed=0;

strata target;   

run;    

         

proc freq data=train_pot.;    

table target*numberhits /out=check_boost;   

run;    

    

    

data train_final;    

set train_pot.;   

do i = 1 to numberhits;   

  output;  

end;   

run;   

Trusted Advisor
Posts: 1,195

Re: Boosting Sampling - Please Help Thank you

Hi,

What is the desired output you are looking for as a result of boosting sampling?

Frequent Contributor
Posts: 95

Re: Boosting Sampling - Please Help Thank you

I would like to Boost the training file to have a 50:50 target1 vs target0.....

Thank You..


Trusted Advisor
Posts: 1,195

Re: Boosting Sampling - Please Help Thank you

Sorry for late reply,objective is to over sample a rear event in the model development data but using method above you may have multiple same observations in train_final data for target=1 right?

Ask a Question
Discussion stats
  • 3 replies
  • 239 views
  • 0 likes
  • 2 in conversation