Is there any way of replicating proc surveyselect in CAS for bootstrapping datasets - I have found sampling.srs action but can't seem to get the code working and as a second point proc surveyselect returns the number of times a row has been selected can we do the same with sampling.srs ?
Hello,
It's important to know which Viya version you are using , because in Viya 3.5 it is difficult (but possible) to get always the same bootstrap re-samples (across memory purges). In Viya 4, it's easy as there is a "reproducibility button" (plastically expressed).
Please submit :
%put &=sysvlong4;
%put &=SYSVIYARELEASE;
%put &=SYSVIYAVERSION;
... and tell us about the results (see log-screen)!
Also, in PROC SURVEYSELECT you probably use method=URS (sampling with equal probability and with replacement) , while the sampling.srs action corresponds to method=SRS.
Go here for what you need (proper bootstrap re-sampling in CAS), but we may need to assist you more to ensure reproducibility and repeatability (first tell us about your Viya release before I go into details here).
BR, Koen
A note on the above (see my earlier reply).
I understand that eternal reproducibility and repeatability may be desired, but we should not -- in the case of bootstrapping -- exaggerate its importance either. After all, if you get completely different results with slightly different re-samples, that is a good signal that something is wrong with the bootstrapping. Then you probably need more and/or bigger re-samples. As soon as you set up reliable bootstrapping, the actual samples (almost) don't matter anymore. The results will be identical to several decimal places.
Koen
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Still thinking about your presentation idea? The submission deadline has been extended to Friday, Nov. 14, at 11:59 p.m. ET.
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.