GAMs in Insurance Modeling

1 Like

Introduction

The insurance sector operates within an intricate framework of risk evaluation, policy pricing, and claims administration. Central to its operations is the imperative for reliable statistical models capable of forecasting outcomes with precision while customizing policies to meet individual requirements.

In response to this demand, Generalized Additive Models (GAM) have gained prominence as a potent instrument, offering adaptability and accuracy beyond conventional methods. This post explores the significance of GAM in insurance modeling, shedding light on its transformative effects within the industry.

Evolution of GAMs in Insurance Modeling

In the realm of insurance modeling, linear regression models have historically been favored for their straightforwardness and ease of interpretation. Nevertheless, they frequently prove inadequate in capturing the intricate, non-linear dynamics inherent in insurance datasets. The emergence of GAMs has marked a paradigm shift in this domain, enabling the incorporation of non-linear relationships and, thus, furnishing a more comprehensive insight into the complexities of risk assessment.

The Mechanics of GAM in Insurance Modeling

GAMs expand upon linear models by integrating smooth functions of predictors, thereby capturing the non-linear patterns inherent in the dataset. Unlike their linear counterparts, GAMs refrain from assuming a predetermined form for the relationship between predictors and the response variable. This characteristic renders GAMs exceptionally well-suited for addressing the multifaceted nature of insurance risks.

Linear regression enjoys broad usage due to its capacity to offer a straightforward, linear explanation and interpretation of relationships. Moreover, it consolidates the impact of predictor variables into a single value, specifically the predictor variable's coefficient, thereby facilitating clearer comprehension and analysis of the data. Assuming normality, the sampling distribution of the coefficient estimates can be reliably determined, simplifying the assessment of the predictor variable's significance in the model. Additionally, the predictive power of each of the predictors can be assessed and compared by analyzing the standardized coefficients.

The linear regression model can be extended in several ways. In Generalized Linear Models (GLM), we can maintain the additive linear parametric structure for the contributions of the predictors. However, instead of directly predicting the mean response of the dependent variable, we can predict a function of the mean of the dependent variable. That is,

Where g(.) is the link function. Consequently, the influence of X manifests through (Xβ+ε), allowing for inference to be conducted under the statistical assumption that Y conforms to an exponential family of distributions. The GLM approach remains parametric but offers greater flexibility compared to linear regression.

Another method to ease the assumptions of linear regression models is to loosen the parametric structure on the right-hand side of the equation. This involves substituting the (Xβ+ε) component with a more versatile function of X, resulting in what is known as nonparametric regression. Therefore, in nonparametric regressions the model is specified as:

The primary objective here is to find a multivariate function F that fits the data.

The third approach employs a model that preserves the additive structure while allowing for more flexibility. This model does not enforce a rigid linear structure of the independent terms to capture the contribution of each variable. The influence of an independent variable X_j on Y is represented by a nonparametric function of X_j – instead of β_jX_j, the effect of a predictor variable is now represented by a more versatile function f_j(X_j). When summed over predictor variables, this approach offers a more organized form of nonparametric regression known as the additive model. The additive is thus specified as:

The functions, f_j(X_j), are known as smoothers as the relationship between Y and f_j(X_j) is assumed to be smooth and continuous.

There are several different types of smoothers available. To circumvent identifiability issues arising from constant terms in f_j interfering with the estimation of α, we centralize the f_j and assume E[f_j(X_j)] = 0, thereby establishing E[Y] = α. The functions, f_j, are not predetermined and for simplicity it is a common practice to choose these smothers as univariate functions. The individual functions f_j within the additive model can be likened to the coefficients in linear regression. However, the interpretation complexities are heightened vis-à-vis linear regression models. Furthermore, these smoothers are concatenated additively to depict the overall relationship between Y and X variables.

It is important to note that the relationship between Y and f_j(X_j) may vary across the range of values of X_j. Hence, we use a spline regression approach where the estimation of f_j is defined in a piecewise manner within local neighborhoods of the X values.

The primary considerations regarding smoothers entail selecting the type or class of smoothers, determining the size of the local neighborhoods for fitting the relationship, and deciding on the level of smoothness for the globally piecewise-patched-together function.

The final extension of linear regression incorporates a link function into the additive model. resulting in what is commonly referred to as the Generalized Additive Model (GAM). Similar to GLMs, GAMs generalize on the distribution of the response variable, but extends the additive sum of predictors to encompass a more versatile specification involving the additive sum of predictor functions. Therefore, GAMs can be stated as:

where g(.) is the link function and Y is assumed to belong to an exponential family of distributions. With g(.) being invertible we can rewrite the GAM model as:

Smoothers

The previous section highlighted the fact the smoothers are central to estimating additive models. A typical method used for solving additive models is to use piecewise linear smoothers. In particular, we represent the smooth terms in an additive model using splines. Rather than aiming to understand everything related to spline functions, we can grasp the essence of the theoretical concepts by exploring certain characteristics of cubic splines.

Select any image to see a larger version.
Mobile users: To view the images, select the "Full" version at the bottom of the page.

The figure above illustrates a cubic spline which is a smooth curve formed by connecting sections of cubic polynomials. These sections are joined in a way that ensures the curve is continuous up to its second derivative. The spline in the figure (the dotted curve) comprises seven cubic sections. The points where these sections meet (displayed as ο), including the two endpoints, are termed knots. Each cubic section possesses unique coefficients, but at the knots, they align with the values of its neighboring sections and first two derivatives.

Estimating GAMs

Estimating a GAM is essentially the task of determining smoothing parameters and model coefficients for a penalized likelihood maximization problem. We select smoothing bases and penalties for each function f_j, resulting in model matrices X_j and corresponding smoothing penalties S_j.

Let's examine a GAM comprising of X with d covariates, and p smoothing functions f_j. Each smoothing function is built using thin-plate regression splines, incorporating a smoothing parameter λ_j.

Thin-plate spline smoothing method estimates the smoothing function f_j by minimizing the following over n observations:

where f=[f(x₁),f(x₂), … ,f(x_n)]^T and J_md is a penalty measure capturing the “wiggliness” of f. The first component assesses the proximity of the fitted values, and the second term penalizes the fit for overall smoothness. λ controls the tradeoff between fit and smoothness. When λ is excessively high, the data tends to be overly smoothed, while if it's too low, the data tends to be insufficiently smoothed.

Employing the designated penalized least squares criterion alongside a predetermined λ value, the estimate of the smooth function f can be expressed as:

Here, δ and θ represent coefficient vectors, with δ constrained by T^Tδ = 0, where T_ij = Φ_j(x_i). Φ_jare linearly independent polynomials that total up to M.

By defining a penalty matrix E_ij = η_md(||x_i – x_j||), the thin plate spline fitting challenge transforms into:

The challenge posed by thin plate splines lies in their computational demands. Except in the single predictor scenario, the computational burden of model estimation scales cubically with the number of parameters.

This leads us to investigate whether a low-rank approximation could be generated to closely mimic the thin plate spline smoothing results, while avoiding excessively high computational costs. We can achieve this using thin-plate regression splines.

Consider E = UDU^T as the eigen-decomposition of E, where D represents a diagonal matrix containing the eigenvalues of E, organized in descending order of absolute values; and the columns of U are the corresponding eigenvectors.

Next, let’s denote U_k as the matrix formed by selecting the first k columns of U, and let D_k denote the upper-left k × k submatrix of D. By confining δ within the column space of U_k and representing it as δ = U_kδ_k, the minimization problem can be restated as:

This constrained problem can be changed to an unconstrained problem as:

Where Z_k is the orthogonal column basis such that T^TU_kZ_k=0.

Furthermore, if we assume

The optimization problem can now be simplified to:

The estimates for the β can be obtained by maximizing the penalized log likelihood function for a given set of λs:

Where S represents roughness penalty.

GAM Node in SAS Dynamic Actuarial Modeling

The GAM node in SAS Dynamic Actuarial Modeling solution pipelines accommodates a generalized additive model designed for a binary or interval target variable, incorporating a defined target distribution and link function. It is available in all SAS Dynamic Actuarial advanced templates. The GAM node shields actuarial analysts from the mathematical complexities involved in estimation process illustrated earlier.

The GAM node seamlessly incorporates all interval input variables as univariate splines. As far as class variables are concerned, we have the option to exclude them from the analysis. If included, then we can convert them to design (dummy) variables using either “GLM” or “Deviation” coding styles.

Additionally, it offers two model selection methods – boosting and shrinkage – to effectively manage and minimize the number of effects in the model. The boosting method selects and estimates a specific adaptation of the gradient descent method, and the model can consist of only parametric effects, only spline effects or a combination of both.

On the other hand, the shrinkage method performs the selection only on the spline terms in the model. The method mandates the inclusion of at least one spline term and employs a grid search to screen the tuning parameters for the sparsity-inducing penalties.

For more information about the GAM node, see Overview of GAM in SAS Viya: Machine Learning Node Reference.

Challenges and limitations of using GAMs in Insurance Modeling

One significant challenge is the selection of smoothing parameters, which control the model’s flexibility. Excessive smoothing may result in an oversimplified model, while insufficient smoothing can lead to excessive complexity.

Moreover, the interpretability of GAMs pose a dilemma. The “black box” nature of these models often obscures the logic behind predictions, making it difficult for analysts to explain outcomes to stakeholders.

SAS Dynamic Actuarial Modeling solution employs various tools like Variable Importance tables, Partial Dependency plots, Individual Conditional Expectation (ICE) plots, etc. to mitigate this interpretability quandary.

The flexibility of GAMs, though advantageous, also raises concerns of overfitting. To address this, techniques like regularization and cross-validation becomes crucial for preventing such occurrences.

Compliance with industry standards is often a non-negotiable aspect of insurance modeling. GAMs must adhere to regulatory requirements, which include model validation and documentation. Ethical considerations also come into play, as models must avoid unfair discrimination in predictions. Ensuring that GAMs meet these standards without compromising their predictive power is a complex challenge.

Additional Information

For more information on SAS Dynamic Actuarial Modeling visit the software information page here.

For more information on curated learnings paths on SAS Solutions and SAS Viya, visit the SAS Training page. You can also browse the catalog of SAS courses here.

Find more articles from SAS Global Enablement and Learning here.