Thank you. I changed the missing codes back to missing.
1) I tested my respons (charge), and it is gamma distribution.
Below is my code for the model.
proc genmod data=have;
class ;
model charge=year gender race ageGroup/dist=gamma link=log lrci;
run;
I did not include marital_status in the model since the p-value greater than 0.05 (I use alpah=0.05). By applying the code into my real data, I have Pr>ChiSq for the intercept 0.8109. I am not sure what shall I do with the intercept since the p-value is so big.
You mentioned scoring method. I am wondering shall I use logistic regression for my model and test all the selection methods: forward, backward, stepwise, and score?
2) I also need to predict the total charges by year. I am thinking to use linear regression to build the model by proc reg with model charge=year. Thanks for any suggestions or hints.
... View more