Generalized Linear Models (GLM) – Healthcare Economist

The generalized linear model (GLM) is a flexible generalization of ordinary least squares regression. OLS restricts the regression coefficients to have a constant effect on the dependent variable. GLM allows for the this effect to vary along the range of the explanatory variables.

The basic structure of GLM estimator is as follows:

g(Y) = Xβ + ε
E(Y) = μ = g^-1(Xβ)

To estimate the model, one needs three components:

Random component, specifying the conditional distribution of the
response variable, given the explanatory variables. Typically, this distribution is from the exponential family.
A linear predictor which is a linear function of the regressors: η = β₀ + β₁X₁ +…+ β_kX_k = Xβ
A link function which transforms the expectation of the response to the linear predictor. In other words, the link function describes the relationship between the linear predictor and the mean of the distribution function. The link function must be invertible.

The table below lists commonly used link functions and their inverse: (source)

Link	η_i=g(μ_i)	μ_i=g^-1(η_i)
Identity	μ_i	η_i
Log	ln(μ_i)	exp(η_i)
Inverse	μ_i^-1	η_i^-1
Inverse-Square	μ_i^-2	η_i^-0.5
Square Root	μ_i^0.5	η_i²
Logit	ln[μ_i/(1- μ_i)]	exp(η_i)/[1+ exp(η_i)]
Probit	Φ^-1(μ_i)	Φ(η_i)
Log-log	-ln[-ln(μ_i)]	exp[-exp(-η_i)]

To estimate the coefficients for a GLM model, most researchers use a maximum likelihood method although Bayesian approaches can also be used. In Stata, the glm command estimates the coefficients from a generalized linear model. In SAS the procedure GENMOD can be used.

1 Comment

1 Comment

Leave a Reply Cancel reply