05-01-2017 10:27 PM
I am analyzing prescription drug costs data from insurance claims. Health care data is heavily skewed with lots of 0's and a few really high cost patients (long tail).
I need to get the mean cost per patient in cohort 1 vs cohort 2. I have the total cost for each individual patient. instead of just taking the log of the cost, then proc means, transform back and be done, I have been asked to use a two-model (proc genmod with log-link). I have no idea how to do this. Any references or examples?
I have basic analytic file:
patid, cohort, covariates 1-6, drugs 1-5 count, drugs 1-5 cost. The zero values are currently just a ".".
05-02-2017 03:13 AM - edited 05-02-2017 03:21 AM
The example here is a proc genmod with a gamma distributed response variable and a log link function
05-02-2017 10:38 AM
Thanks for the response. This makes sense from a regression point of view. I guess, I am just stuck in not knowing what to do with that information. At the end of the day, i need a table that says Cohort A spent $650 on drugs, and Cohort B speng $500, and the diffeerence was significant. I am not sure how to go about getting that sort of output.