About derbygun

derbygun · ‎07-23-2022

proc logistic data = A descending; Class diabetes (ref = '0')/ param = ref; Class arthritis (ref = '0')/ param = ref; class sex (ref = '0')/ param = ref; model DrugA (EVENT = '1') = age sex diabetes arthritis prop_below_poverty prop_with_highschool; run; The above SAS codes show the simple logistic regression I did without accounting for the repeated values of " prop_below_poverty" and "prop_with_highschool"

derbygun · ‎07-23-2022

Hi all, I have a dataset with a binary outcome (The prescription of drug A Yes/No). My dataset is at patient level, meaning there are unique patients in the dataset. We wanted to consider how neighborhoods could affect the use of drug A, so we merged our data by census tracts to neighborhood-level factors (proportion in the census tract living below the poverty level, proportion with a high school degree). The dataset is now set up such that patients in the same census tracts have the same neighborhood level values (see dataset below). I want to run a logistic regression to predict the use of drug A, but I would like to account for the repeated values as a result of the census tract. How do I do this? PatientID age sex diabetes arthritis Drug A census_tract prop_below_poverty prop_with_highschool 1 47 male 0 1 1 47157002000 15.0 47.0 2 51 female 0 0 1 47157002000 15.0 47.0 3 34 female 1 1 0 47157002000 15.0 47.0 4 65 male 1 0 0 47157008500 8.6 75.0 5 27 male 1 0 1 47157008500 8.6 75.0 6 34 male 0 0 0 47157008500 8.6 75.0 7 70 female 1 1 1 47157008500 8.6 75.0 8 62 male 1 0 1 47157021136 12.1 62.0 Drug A = dependent variable /outcome. (Was determined at patient level) Diabetes ( 0 = no diabetes, 1 = has diabetes) arthritis (0 = no arthritis, 1 has arthritis) prop_below_poverty and prop_with_highschool (continuous variables calculated as percentages) Thank you.

derbygun · ‎04-29-2022

Hi All, I am a bit confused about the variable to use with the repeated statement. My analysis has a binary outcome. Some of the variables included in the model were measured at the zip code level, meaning patients in the same zipcodes would have the same values for the variables I intend to include in the model. I am confused if the repeated statement should be zipcode or person_id. proc genmod data= y descending ; class zipcode gender (ref="0") race (ref="0") cgd (ref="0") cbf (ref="0") cdd (ref="0") csd (ref="0") obesepat (ref="0") hpsa (ref="0") LT_100FPL_F4 (ref="0") LT_138FPL_F4 (ref="0")/ param=glm; model b = age gender race cgd cbf obesepat cdd csd bachdegree hpsa LT_100FPL_F4 LT_138FPL_F4 / dist=bin link=logit; repeated subject=zipcode/ type=cs; lsmeans gender race chf cvd obesepat cad ckd hpsa LT_100FPL_F4 LT_138FPL_F4/diff oddsratio cl; run; OR proc genmod data= y descending ; class person_id gender (ref="0") race (ref="0") cgd (ref="0") cbf (ref="0") cdd (ref="0") csd (ref="0") obesepat (ref="0") hpsa (ref="0") LT_100FPL_F4 (ref="0") LT_138FPL_F4 (ref="0")/ param=glm; model b = age gender race cgd cbf obesepat cdd csd bachdegree hpsa LT_100FPL_F4 LT_138FPL_F4 / dist=bin link=logit; repeated subject= person_id/ type=cs; lsmeans gender race chf cvd obesepat cad ckd hpsa LT_100FPL_F4 LT_138FPL_F4/diff oddsratio cl; run; Thank you all.

derbygun · ‎04-28-2022

Okay, thank you. I wanted to ask what the repeated measure would be in this case. I am confused if it would be the zip code or the person_id. in my dataset, patients in the same zipcodes were assigned the same values for the variables I am interested in

derbygun · ‎04-28-2022

Hi All, I am currently working on an analysis with a binary outcome. I have a dataset in which some of the variables I want to include in the model were measured at the zipcode level. Meaning patients in the same zipcodes would have the same values. How do I account for this in my logistic regression model? Thanks

derbygun · ‎03-23-2022

@ballardw Yes, the dates are actual SAS value dates (numeric). For each person _id, I did a proc sort by date to see the dates in ascending order. I have the patients' medication names, so I categorized them into the three possible different classes. When I did that, I realized some patients received multiple medications on the same date, so I can not categorically classify them into a particular drug class. My main aim is to delete patients that received multiple medications on the same date. I am attaching another data set below. PersonID Visit ID Names of Medications Start date Drug class A Drug class B Drug class C 2 3866897 dula 28-Nov-18 0 1 0 2 6139545 dula 19-Jun-19 0 1 0 2 14110036 dula 7-Oct-20 0 1 0 4 3866996 dapa-met 12-Nov-18 1 0 0 4 3866996 Insu-lixi 12-Nov-18 0 1 0 4 6048002 dula 8-Jun-20 0 1 0 4 60410643 dula 5-Oct-20 0 1 0 6 1453918 Sita 22-Mar-17 0 0 1 6 2470214 dula 21-May-18 0 1 0 6 3866906 dula 17-Dec-18 0 1 0 I would like to delete patient 4 because he received multiple medications (dapa-met, Insu-lixi )on the same date

derbygun · ‎03-23-2022

Hi All, I am working on an electronic health record database where I have selected individuals using some specific drugs. I want to use the last medication received to categorize each patient into a particular drug class. However, some received multiple drugs on the same date and I would like to completely remove them from my data. See the example table below; PersonID Visit ID Medication description Start date Drug class A Drug class B Drug class C 2 389965 Drug class A 28th Nov 2015 1 0 0 2 389965 Drug class A 1st Feb 2016 1 0 0 2 614578 Drug class A 19thJune 2019 1 0 0 2 1456893 Drug class A 10th Oct 2020 1 0 0 4 604822 Drug class B 13 May 2019 0 1 0 4 965534 Drug class B 19 Aug 2019 0 1 0 4 453398 Drug class B 01 April 2020 0 1 0 4 212234 Drug class B 13 May 2020 0 1 0 4 212234 Drug class A 13 May 2020 1 0 0 For patient 2, based on their last date they are certainly on drug class A, but for patient 4 based on their last date, he was on a multiple drug combination. How do I delete patient 4 and all similar patients from my database using just the data from their more recent records? Thank you all.

Online Status	Offline
Date Last Visited	‎08-25-2022 09:49 PM

Re: Logistic regression -accounting for clustering/repeated values

Logistic regression -accounting for clustering/repeated values

Proc Genmod / Repeated statement

Re: Accounting for clustering within my dataset in a logistic regressi...

Accounting for clustering within my dataset in a logistic regression a...

Re: Excluding those with multiple drug class claims on same index date

Excluding those with multiple drug class claims on same index date