The GEE Procedure

Type 3 Analysis

A Type 3 analysis is similar to the Type 3 sums of squares used in PROC GLM, except that generalized score tests for Type 3 contrasts instead of Type 3 sums of squares are computed. Briefly, a Type 3 estimable function (contrast) for an effect is a linear function of the model parameters that involves the parameters of the effect and any interactions with that effect. A test of the hypothesis that the Type 3 contrast for a main effect is equal to 0 is intended to test the significance of the main effect in the presence of interactions. For more information about Type 3 estimable functions, see ChapterĀ 53, The GLM Procedure, and ChapterĀ 16, The Four Types of Estimable Functions. Also see Littell, Freund, and Spector (1991).

Boos (1992) and Rotnitzky and Jewell (1990) describe score tests applicable to testing bold upper L prime bold-italic beta equals bold 0 in GEEs, where bold upper L prime is a user-specified r times p contrast matrix or a contrast for a Type 3 test of hypothesis.

Let bold-italic beta overTilde be the regression parameters that result from solving the GEE under the restricted model bold upper L prime bold-italic beta equals bold 0, and let bold upper S left-parenthesis bold-italic beta overTilde right-parenthesis be the generalized estimating equation values at bold-italic beta overTilde.

The generalized score statistic is

upper T equals bold upper S left-parenthesis bold-italic beta overTilde right-parenthesis prime bold upper Sigma Subscript m Baseline bold upper L left-parenthesis bold upper L prime bold upper Sigma Subscript e Baseline bold upper L right-parenthesis Superscript negative 1 Baseline bold upper L prime bold upper Sigma Subscript m Baseline bold upper S left-parenthesis bold-italic beta overTilde right-parenthesis

where bold upper Sigma Subscript m is the model-based covariance estimate and bold upper Sigma Subscript e is the empirical covariance estimate. The p-values for T are computed based on the chi-square distribution with r degrees of freedom, where r is the rank of bold upper L.

A Type 3 analysis can consume considerable computation time because a constrained model is fitted for each effect. Wald statistics for Type 3 contrasts are computed if you specify the WALD option. Wald statistics for contrasts use less computation time than likelihood ratio statistics but might be less accurate indicators of the significance of the effect of interest. The Wald statistic for testing bold upper L prime bold-italic beta equals bold 0 is defined by

upper S equals left-parenthesis bold upper L prime ModifyingAbove bold-italic beta With caret right-parenthesis prime left-parenthesis bold upper L prime bold upper Sigma Subscript e Baseline bold upper L right-parenthesis Superscript negative 1 Baseline left-parenthesis bold upper L prime ModifyingAbove bold-italic beta With caret right-parenthesis

where bold upper L is the contrast matrix, bold-italic beta are the GEE parameter estimates, and bold upper Sigma Subscript e is the empirical covariance estimate. The asymptotic distribution of S is chi-square with r degrees of freedom, where r is the rank of bold upper L.

The results of this type of analysis do not depend on the order in which the terms are specified in the MODEL statement. Type 3 analyses that use score statistics are not supported for nominal response data or weighted GEE methods. Type 3 analyses can be conducted using the Wald statistics for all the models that the GEE procedure supports.

The preceding development for Type 3 tests assumes that the rank of the empirical covariance matrix bold upper Sigma Subscript e is not less than the row rank of the contrast matrix bold upper L. When the rank of bold upper Sigma Subscript e is less than the row rank of bold upper L, estimability of the function is not sufficient to ensure that the chi-square test statistic has a unique value no matter what kind of generalized inverse is used to compute left-parenthesis bold upper L prime bold upper Sigma Subscript e Baseline bold upper L right-parenthesis Superscript minus.

Although it is extremely rare, it is possible in practice that the uniqueness condition is not satisfied. For example, if the number of clusters is less than the number of nonsingular parameters in the model, then the matrix of coefficients for testing the overall null does not satisfy the uniqueness condition. If this condition is not satisfied, then the chi-square statistic for testing upper H colon bold upper L bold-italic beta equals bold 0 is not invariant to the choice of the g 2-inverse of bold upper L bold upper Sigma Subscript e Baseline bold upper L prime. This chi-square test is not recommended when the uniqueness condition is not satisfied. An alternative approach would be to increase the number of clusters or to find a parsimonious model so that the number of parameters is less than the number of clusters. When the rank of bold upper Sigma Subscript e is less than the row rank of bold upper L for a test, the procedure prints a note to the SAS log.

Last updated: December 09, 2022