05-10-2013 05:45 AM
I am new to sas but need to do the following:
my data (this is just dummy data, the full set contains 25 million obs)
|phone number||HV Customer||Region||Age||Port in||pmt method||spend bucket|
|0832122256||High Value||Gauteng||25||yes||Prepaid||R120 to R155|
|0833132424||Low Value||Gauteng||26||yes||postpaid||R0 to R50|
|0834532675||Very high Value||KZN||12||yes||postpaid||R180 to R230|
|0854217863||Mid Value||Limpopo||20||no||prepaid||R260 to R300|
Create a random sample which consists of the following:
any idea as to how to go about tackling this problem
05-10-2013 07:36 AM
That's very specific criteria, so you may have to use more than one method. I would suggest reading up on PROC SURVEYSELECT which is used for extracting samples. You can create a dataset that contains the relevant variable names and how many of each should be returned. this is then referenced in the SAMPSIZE= option in PROC SURVEYSELECT. There are many other options you can specify to control the output.
Hope this points you in the right direction.