Two part model for health care costs

05-01-2017 10:27 PM

Hi,

I am analyzing prescription drug costs data from insurance claims. Health care data is heavily skewed with lots of 0's and a few really high cost patients (long tail).

I need to get the mean cost per patient in cohort 1 vs cohort 2. I have the total cost for each individual patient. instead of just taking the log of the cost, then proc means, transform back and be done, I have been asked to use a two-model (proc genmod with log-link). I have no idea how to do this. Any references or examples?

I have basic analytic file:

patid, cohort, covariates 1-6, drugs 1-5 count, drugs 1-5 cost. The zero values are currently just a ".".

Thanks!

Chris

05-02-2017 03:13 AM - edited 05-02-2017 03:21 AM

The example here is a proc genmod with a gamma distributed response variable and a log link function

05-02-2017 10:38 AM

Thanks for the response. This makes sense from a regression point of view. I guess, I am just stuck in not knowing what to do with that information. At the end of the day, i need a table that says Cohort A spent $650 on drugs, and Cohort B speng $500, and the diffeerence was significant. I am not sure how to go about getting that sort of output.

Thanks!

Chris

a week ago

I wrote a paper on this

https://support.sas.com/resources/papers/proceedings15/3600-2015.pdf

Maybe that will help.