About Jay_210

Jay_210 · ‎01-27-2023

I have a data set that has a list of departments and an employee (approximately 100 departments, and 80-160 employees in each department). I want to easily select the first X amount of employees in each department, but X is specific to the department. For instance, Department 1 I may want 17, Dept 2 19, Dept 3 24, and so on. I have a second dataset that has a list of each department and the number of observations I want to select from that department. I do not want them picked randomly, they need to be the first X listed in that department. Department Employee dept_1 jim dept_1 jon dept_1 nancy dept_1 eric dept_1 tom dept_2 debbie dept_2 susan dept_2 karen dept_2 don dept_2 nate dept_2 chris dept_2 brian dept_3 valerie dept_3 bob dept_3 mike dept_3 tim dept_3 ryan dept_3 stephanie dept_3 tyler Department Obs_need dept_1 2 dept_2 4 dept_3 5 Thanks.

Jay_210 · ‎01-23-2023

This is interesting. I do get the same errors as I replied to ballardw involving samprate/sampsize though.

Jay_210 · ‎01-23-2023

When I run this with "samprate= .1" I get an error: ERROR: The SAMPRATE= option may not be specified with METHOD=PPS. If I change it to sampsize=10 I get an error: ERROR: For METHOD=PPS, the relative size of each sampling unit must not exceed (1/SAMPSIZE). I do like your suggestion to use Sampsize=datasetname once I can figure this out.

Jay_210 · ‎01-23-2023

I have, yes. I was going to reply but I still need to look into the PPS options for better understanding. I do have a list of entrant_ids and the number of entries they have. I was under the impression that wouldn't work for PPS by strata, but if I was wrong, it might work for me.

Jay_210 · ‎01-23-2023

I need to keep the duplicates in the file that I do the survey select on because those are entries in the drawing. It gives them a better chance of getting picked. What I'm trying to figure out is how to make it non select an entrant after they have already been selected (based on entrant_id).

Jay_210 · ‎01-20-2023

I can't remove the duplicate entrant ID values because that would eliminate their multiple entries. Each line in the data set is sort of like their "raffle ticket" so if I remove any it removes their multiple "raffle tickets." I'm already using SRS, but the "without replacement" feature is going line by line. I need it to go Entrant ID by Entrant ID. I hope that makes sense.

Jay_210 · ‎01-20-2023

I'm trying to use proc surveyselect to conduct a random drawing. Entrants can have multiple entries, and I have a table that lists each entry. The problem is I'm getting duplicate winners because of the multiple entries. Is there a way to exclude duplicate observations by a certain column (Entrant_ID in this case)? Consider this data set, but in actuality I have 100 dept names, and about 3,000 entrant IDs (most of which are duplicates. There are really about 90 unique entrant IDs per Dept Name. Entrant ID's will not duplicate over different dept's). Also, I will have a different actual sampsize for each dept, but I understand there is a way to make that reference a separate table, so I will be looking to do that. Dept_Name Entrant_ID Dept 1 JRTHAL Dept 1 TLSMIT Dept 1 JRTHAL Dept 1 VLNEW Dept 2 MRREL Dept 2 MNJON Dept 2 MRREL Dept 2 MNJON Dept 2 NWCON Dept 3 JRMCC Dept 3 ADLON Dept 3 JRMCC Dept 3 ADLON Dept 3 BFPIT Dept 3 BFPIT Code I am using: proc surveyselect data=work.random_draw_entries out=all_winners method=srs reps=1 sampsize=10; strata dept_name; run; Also, I'm not tied to using surveyselect, and I have many ways to manipulate the data if you have suggestions. Thanks!

Online Status	Offline
Date Last Visited	‎03-07-2023 05:16 PM

selecting different # observations per strata using dataset

Re: Random Drawing using Surveyselect - no duplicate winners

Re: Random Drawing using Surveyselect - no duplicate winners

Re: Random Drawing using Surveyselect - no duplicate winners

Re: Random Drawing using Surveyselect - no duplicate winners

Re: Random Drawing using Surveyselect - no duplicate winners

Random Drawing using Surveyselect - no duplicate winners

Re: selecting different # observations per strata using dataset

selecting different # observations per strata using dataset

Re: Random Drawing using Surveyselect - no duplicate winners

Re: Random Drawing using Surveyselect - no duplicate winners

Re: Random Drawing using Surveyselect - no duplicate winners

Re: Random Drawing using Surveyselect - no duplicate winners

Re: Random Drawing using Surveyselect - no duplicate winners

Random Drawing using Surveyselect - no duplicate winners