Hi folks! Have been working on a VA data analysis project that aims to predict the workload needed for a future inpatient based on his/her demographic, DRG, and health care characteristics. The response variable is PCRVU (Primary-Care relative value unit for all primary care visits during the year) which is continous, and we have a cross-sectional data set pulled out of, up to now, 6 different VA facilities. The are a number of independent variables, that can be grouped as 1.health related (such as Inpatient Days (LOS), CanScore (severity of the patient illness), assigned provider, etc.) 2. patient demographic attributes (such as zip code, gender, insurance status, etc.) and 3. war-related columns (such as radiation status, agent orange status, etc.). Previous attempts were using SAS E-Miner for OLS regression and CART that could not yield a reasonable R-Square. I'm thinking to use GLMM or GAM procedures but not pretty sure the way to approach the problem. Any helpful/professional comment would be appreciative. Thanks! Issac
... View more