Convergence problems are quite common with models fit by iterative optimization methods such as maximum likelihood or GEE (as in this case) and such problems can happen in many possible ways which depend on the data and model. Convergence of models fit by iterative methods can never be guaranteed. The cause in any particular case cannot typically be determined from examining the model specification or data. A solution often must be found by experimentation. The most helpful strategy is usually to simplify the model (that is, reduce the number of model parameters) in some acceptable way such as by removing higher-order effects such as interactions, removing predictors, or by dropping or merging categories of CLASS variables. In general, the more parameters there are in the model, the more likely convergence problems become.
In categorical response models like this, sparseness of the data is common and can cause various fitting errors, though this is not the only possible cause. Generally, as model complexity increases and sample size decreases, the problem becomes more sparse and more likely to result in convergence problems. So again, model simplification is generally required. Starting with a simple model and adding variables as they can be supported is often a good strategy.
Note that GEE models can also be fit in the newer PROC GEE using similar syntax. You could try it since any variation in the fitting algorithm of an iterative method like GEE can potentially affect convergence. PROC GEE is a newer procedure specifically for fitting the GEE model and is the recommended procedure when fitting that model. It adds support for nominal multinomial response data and data that are missing at random and might perform better in some respects.
... View more