About ZZ_Zheng

ZZ_Zheng · ‎05-17-2019

Hi @ballardw, forgiving my late response. Thank you so much, it is what I want to handle the "overlap"!

ZZ_Zheng · ‎05-14-2019

Hi Astounding, thanks. I would try a different way!

ZZ_Zheng · ‎05-14-2019

Hi all, I made a simple sample and hope this explains my question clearly. What I want to do is calculate the proportion of days covered by medication. The data set "Have" contains end_date: the end date of the medication observation period. stat_dt: the start date of the medication observation period. fill_dt: when patients refill their drug. supply_days: supply days of each fill. days_in: I calculate this by "end_date" - "start_date", is the observations period in days between end date and start date. It is obvious each patient will have different observation length. Firstly I transposed "Have" twice to a wide format with each row representing one patient, " have_fill_dates" transposed fill date and "have_days_supply" transposed supply days, then I merged two data sets and created many dummy days representing each day in observation period. Then I got the problem, how to specific different length for each patient in array statement? I used "####" to represent the problems need fix. The first reason I create many single dummy days during the observation period rather than just sum up supply days and then divide by observation period, which is much more convenient than this array is that I will later need to calculate proportion by month and by quarter, I need these dummy days. the second reason is that there are some overlaps, such as the second observation of stuyd_id 2, he refilled his drugs on " 09/08/2016" which is within 30 days after previous refill date "08/22/2016", and I need to move "09/08/2016" to "09/22/2016" to then assume patients refill their drugs right after exhausting previous fill. My primary question is how to create different array length? "####" part in below program. And the second question is any suggestion on how to move the fill_dt right after exhausting the previous fill more efficient in the array do loop part. for example, move "09/08/2016" to "09/22/2016" for the second observation of study id 2. Thank so much! data have; informat study_id $4.; informat end_date mmddyy10.; informat fill_dt mmddyy10.; informat supply_days best8.; informat start_dt mmddyy10.; informat days_in best9.; input study_id end_date fill_dt supply_days start_dt days_in; format end_date fill_dt start_dt mmddyy10.; datalines; 1 02/13/2018 07/07/2017 30 08/27/2017 170 1 02/13/2018 08/25/2017 30 08/27/2017 170 1 02/13/2018 12/19/2017 35 08/27/2017 170 1 02/13/2018 01/23/2018 25 08/27/2017 170 1 02/13/2018 03/09/2018 35 08/27/2017 170 2 10/14/2017 08/22/2016 30 10/27/2016 352 2 10/14/2017 09/08/2016 30 10/27/2016 352 2 10/14/2017 12/13/2016 30 10/27/2016 352 2 10/14/2017 01/11/2017 30 10/27/2016 352 2 10/14/2017 02/04/2017 30 10/27/2016 352 2 10/14/2017 02/11/2017 30 10/27/2016 352 2 10/14/2017 05/01/2017 30 10/27/2016 352 2 10/14/2017 05/28/2017 30 10/27/2016 352 2 10/14/2017 08/01/2017 25 10/27/2016 352 ; run; proc sort data=have; by study_id end_date;run; proc transpose data=have out=have_fill_dates (drop=_name_) prefix=fill_dt; by study_id end_date start_dt days_in; var fill_dt; run; proc transpose data=have out=have_days_supply (drop=_name_) prefix=days_supply; by study_id end_date start_dt days_in; var supply_days; run; /*merge fill dates and days supply*/ data both_havex; merge have_fill_dates have_days_supply; by study_id;run; /*Need help*/ data both_have; set both_havex; array daydummy(days_in or ####) day1-day####; array filldates(*) fill_dt1 - fill_dt141; array days_supply(*) days_supply1 - days_supply141; do ii=1 to ####; daydummy(ii)=0; end; do ii=1 to ####; do i=1 to 141 while (filldates(i) ne .); if filldates(i) <= pre_2y_dt + ii -1 <= filldates(i)+days_supply(i)-1 then daydummy(ii)=1; end; end; drop i ii; dayscovered=sum(of day1 - day####); p_dayscovered=dayscovered/####; run;

ZZ_Zheng · ‎04-09-2019

Hi @Kurt_Bremser. Thanks!

ZZ_Zheng · ‎04-09-2019

data origin; length study_id $10; input study_id quarter; datalines; 1 1 1 2 1 3 1 4 1 5 1 6 1 7 1 8 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 8 100 1 100 2 100 3 100 4 100 5 100 6 100 7 100 8 1000 1 1000 2 1000 3 1000 4 1000 5 1000 6 1000 7 1000 8 1001 1 1001 2 1001 3 1001 4 1001 5 1001 6 1001 7 1001 8 ; run; Hi @Kurt_Bremser and @All. Sorry, I should create a simple sample data, my raw data is large and credential and I using sas enterprise under the off-line system. Thanks!

ZZ_Zheng · ‎04-09-2019

s This is the original data set used to resample, resample unit is the study_id, or resample a cluster of 8 observations of sample study_id. Thanks!!

ZZ_Zheng · ‎04-09-2019

Hi, I hope I would not confuse those without domain knowledge. I want to bootstrap 100 data sets with replacement and fit a random effect model in each bootstrap sample. Below is the data set I would use to resample, each unique "study_id" represent a subject, each subject have 8 records("quarter"). Then I use proc surveyselect and specify the sample unique = "study_id" %let NumSamples = 10; /* number of bootstrap resamples */ /* 2. Generate many bootstrap samples */ proc surveyselect data=origin seed=345 out=Bootsample1(rename=(Replicate=SampleID)) method=urs /* resample with replacement */ samprate=1 /* each bootstrap sample has N observations */ OUTHITS /* option to suppress the frequency var */ reps=&NumSamples; /* generate NumSamples bootstrap resamples 426*/ samplingunit study_id; run; After resampling, we see that study_id 10000 was selected twice, he has 2*8=16 observations. The final step is to fit a random effect model with a random intercept for each study_id, which require to specify the cluster variables, It requires study_id 10000 was account as two subjects with the same 8 observations and sample results, rather than one subjects with 16 observations. For example, treat the second 8-duplicate-records of "study_id" 10000 as another different "study_id" 10000A and keep the first "study_id" 10000 then we have two study_id 10000 and 10000A that have same observations and same results. proc glimmix data=zheng_no0 method=quad(qpoints=10); /*edcn all factors latent class risk_score*/ class study_id pred3class(ref='2') gender age_cat(ref='1') ge_12mo_flag(ref='0'); effect spl = spline(log_risk/naturalcubic degree=3 knotmethod=percentiles(3)); model pct_sum = pred3class gender age_cat spl ge_12mo_flag/link=log s dist=poisson offset=logt; random int / subject=study_id; run; Is there any way I could just change the study_id? Thanks!

ZZ_Zheng · ‎03-04-2019

Thanks! PG Stats. It works well!

ZZ_Zheng · ‎03-04-2019

Hi, I have a 6700 person sample size with 9 binary (0 or 1) variables, I want to see the most common (maybe top 20) combinations of these 2^9 possible combinations since most combinations will have 0 or 1 observation. In other words, I want to sort the proc tabulate output by "sum" or "n". I also create a variable called x, which equal 1 for every observation. Below are my code and SAS output proc tabulate data=paper order=freq; class structural_flag social_support_flag behaviors_flag bills_flag housing_flag mental_health_flag resources_flag jail_flag food_cat; var x; table structural_flag*social_support_flag*behaviors_flag*bills_flag*housing_flag*mental_health_flag*resources_flag*jail_flag*food_cat, (x)*(Sum colpctn);run;

Online Status	Offline
Date Last Visited	‎10-02-2020 03:00 PM

Re: How to create different array length for each ID

Re: How to create different array length for each ID

How to create different array length for each ID

Re: How to change duplicate rows to another unique rows

Re: How to change duplicate rows to another unique rows

Re: How to change duplicate rows to another unique rows

How to change duplicate rows to another unique rows

Re: How to sort by the most common combinations in proc tabulate.

How to sort by the most common combinations in proc tabulate.

Re: How to create different array length for each ID

Re: How to create different array length for each ID

How to create different array length for each ID

Re: How to change duplicate rows to another unique rows

Re: How to change duplicate rows to another unique rows

Re: How to change duplicate rows to another unique rows

How to change duplicate rows to another unique rows

Re: How to sort by the most common combinations in proc tabulate.

How to sort by the most common combinations in proc tabulate.