How to analyze balanced and unbalanced panel data using SAS

pablas29 · Posted 03-16-2021 12:18 PM

I have panel data and I ran the below SAS code to get the coefficients.

PROC GLIMMIX DATA=data1 ;
CLASS pid year;
MODEL employed(event='Yes')= age / SOLUTION ;
RANDOM INTERCEPT/ SUBJECT = pid ;
RUN;

PROC PANEL DATA=data1;
MODEL employed = age /RANone ;
ID pid year;
RUN;

I tried doing a similar thing using Stata as below but the results between SAS and Stata output are different.

xtset pid year
xtlogit employed age

I am not sure which is the correct result? Also, do I need to add any option when running similar code on unbalanced panel data?

Balanced data panel example:

pid	year	age	employed
1	2001	33	yes
1	2002	34	no
1	2003	35	yes
2	2001	23	yes
2	2002	24	yes
2	2003	25	yes

Cross-posted.

StatDave · Posted 03-16-2021 12:53 PM

First, since your response is binary, you should specify DIST=BINARY or BINOMIAL in the MODEL statement in GLIMMIX. However, there are many ways to analyze repeated measures/panel data like this. The random effects model is one way. Another is the Generalized Estimating Equations (GEE) model, available in PROC GEE or PROC GENMOD. There is also the conditional logistic model available in PROC LOGISTIC with the STRATA statement. See this note regarding types of logistic models.

How to analyze balanced and unbalanced panel data using SAS

Re: How to analyze balanced and unbalanced panel data using SAS

SAS Innovate 2025: Call for Content