About StatsGirl13

StatsGirl13 · ‎11-19-2012

Steve, Thanks for the response and joining the dialogue on how to choose from competing models. Your input was helpful. Martha

StatsGirl13 · ‎11-19-2012

Susan, Thank you for the information. This is most useful. I'd like some clarification on the scaling factor in the AIC. I'm not a mathematician, and I fail to see it when looking at the formula. Martha

StatsGirl13 · ‎10-30-2012

Hello, I am running a generalized linear model with a Poisson log link function (estimating expected claim frequency using claim counts as my dependent variable). I have, say, 20 predictor variables which are all categorical in nature. Our industry (insurance) tends to use these sorts of models a lot, but I need some guidance as to 1) how best to determine model fit and 2) how to compare one run to another for which is "better." Here's what I do now: 1. Attain convergence. 2. Examine the Scaled Deviance divided by its degrees of freedom. I've heard that values close to 1.0 are desirable. What does it mean if the value is below 1.0? Overdispersion of the data? (What does that mean?) My last runs yielded values between 0.05-0.22. My typical application can have 1.25 million observations, so looking at the GENMOD model fit table doesn't tell me much--the numbers are basically off the charts (in a good direction) based on the number of observations (and thus, df) I'm modeling. 3. Look at the AICC, knowing that "smaller is better." Typically I'm trying to compare AICC's from one run to another. I have observed very subtle differences (say, Model 1 has an AICC of 72,305 and Model 2 has an AICC of 72,320). Is this a meaningful difference? My intuition says not. 4. Use of residuals. I know the classic literature on GLMs says to always examine your residuals. I once tried it but found that due to my numbers any meaningful conclusions were difficult. Would it make sense to take a random sample of the residuals and examine those? 5. Use of the ASSESS statement. Tried it. Sounded intriguing. Got lost. Couldn't understand the output. I'd appreciate any information from some of the more seasoned GENMOD users/modelers out there. Thank you so much. Marty J.

StatsGirl13 · ‎10-08-2012

Thank you! These are very helpful.

StatsGirl13 · ‎10-05-2012

We'd like to take a set of data with complete address information and assign Census block groups. We currently license the SAS Visual Data Discovery package which includes SAS/Base, SAS/Graph, SAS/Stat, SAS/Enterprise Guide, and JMP. I have PROC GEOCODE which I believe can be used to assign longitude and latitude coordinates to a given street address. Then presumably those could be matched to Census TIGER/Shape files to pick up the census block. Does it sound like I'm on the right track? What is SAS/GIS? Is it a separately licensed product, or a procedure?

Online Status	Offline
Date Last Visited	‎09-01-2015 07:11 AM

Re: Determining a Good Fit When Using GENMOD

Re: Determining a Good Fit When Using GENMOD

Determining a Good Fit When Using GENMOD

Re: Assigning Census Block Group to My Data

Assigning Census Block Group to My Data

Re: Determining a Good Fit When Using GENMOD

Re: Determining a Good Fit When Using GENMOD

Determining a Good Fit When Using GENMOD

Re: Assigning Census Block Group to My Data

Assigning Census Block Group to My Data