The PHREG Procedure

Diagnostics Based on Weighted Residuals

ZPH Diagnostics

The vector of weighted Schoenfeld residuals, bold r Subscript i, is computed as

bold r Subscript i Baseline equals n Subscript e Baseline script upper I Superscript negative 1 Baseline left-parenthesis ModifyingAbove bold-italic beta With caret right-parenthesis ModifyingAbove bold upper U With caret Subscript i Baseline left-parenthesis t Subscript i Baseline right-parenthesis

where n Subscript e is the total number of events and ModifyingAbove bold upper U With caret Subscript i Baseline left-parenthesis t Subscript i Baseline right-parenthesis is the vector of Schoenfeld residuals at event time t Subscript i. The components of bold r Subscript i are output to the WTRESSCH= variables in the OUTPUT statement.

The weighted Schoenfeld residuals are useful in assessing the proportional hazards assumption. The idea is that most of the common alternatives to the proportional hazards can be cast in terms of a time-varying coefficient model,

lamda left-parenthesis t comma bold upper Z right-parenthesis equals lamda 0 left-parenthesis t right-parenthesis exp left-parenthesis beta 1 left-parenthesis t right-parenthesis upper Z 1 plus beta 2 left-parenthesis t right-parenthesis upper Z 2 plus midline-horizontal-ellipsis right-parenthesis

where lamda left-parenthesis t comma bold upper Z right-parenthesis and lamda 0 left-parenthesis t right-parenthesis are hazard rates. Let ModifyingAbove beta With caret Subscript j and r Subscript i j be the jth component of ModifyingAbove bold-italic beta With caret and bold r Subscript i, respectively. Grambsch and Therneau (1994) suggest using a smoothed plot of (ModifyingAbove beta With caret Subscript j Baseline plus r Subscript i j) versus t Subscript i to discover the functional form of the time-varying coefficient beta Subscript j Baseline left-parenthesis t right-parenthesis. A zero slope indicates that the coefficient does not vary with time.

DFBETA Diagnostics

The weighted score residuals are used more often than their unscaled counterparts in assessing local influence. Let ModifyingAbove bold-italic beta With caret Subscript left-parenthesis i right-parenthesis be the estimate of bold-italic beta when the ith subject is left out, and let delta ModifyingAbove bold-italic beta With caret Subscript i Baseline equals ModifyingAbove bold-italic beta With caret minus ModifyingAbove bold-italic beta With caret Subscript left-parenthesis i right-parenthesis. The jth component of delta ModifyingAbove bold-italic beta With caret Subscript i can be used to assess any untoward effect of the ith subject on ModifyingAbove beta With caret Subscript j. The exact computation of delta ModifyingAbove bold-italic beta With caret Subscript i involves refitting the model each time a subject is omitted. Cain and Lange (1984) derived the following approximation of bold upper Delta Subscript i as weighted score residuals:

delta ModifyingAbove bold-italic beta With caret Subscript i Baseline equals script upper I Superscript negative 1 Baseline left-parenthesis ModifyingAbove bold-italic beta With caret right-parenthesis ModifyingAbove bold upper L With caret Subscript i

Here, ModifyingAbove bold upper L With caret Subscript i is the vector of the score residuals for the ith subject. Values of delta ModifyingAbove bold-italic beta With caret Subscript i are output to the DFBETA= variables. Again, when the counting process MODEL specification is used, the DFBETA= variables contain the component script upper I Superscript negative 1 Baseline left-parenthesis ModifyingAbove bold-italic beta With caret right-parenthesis left-parenthesis bold upper L Subscript i Baseline left-parenthesis ModifyingAbove bold-italic beta With caret comma t 2 right-parenthesis minus bold upper L Subscript i Baseline left-parenthesis ModifyingAbove bold-italic beta With caret comma t 1 right-parenthesis right-parenthesis, where the score process bold upper L Subscript i Baseline left-parenthesis bold-italic beta comma t right-parenthesis is defined in the section Residuals. The vector delta ModifyingAbove bold-italic beta With caret Subscript i for the ith subject can be obtained by summing these components within the subject.

Note that these DFBETA statistics are a transform of the score residuals. In computing the robust sandwich variance estimators of Lin and Wei (1989) and Wei, Lin, and Weissfeld (1989), it is more convenient to use the DFBETA statistics than the score residuals (see Example 92.10).

Last updated: March 08, 2022