The PHREG Procedure

Influence of Observations on Overall Fit of the Model

The LD statistic approximates the likelihood displacement, which is the amount by which minus twice the log likelihood (minus 2 log upper L left-parenthesis ModifyingAbove bold-italic beta With caret right-parenthesis), under a fitted model, changes when each subject in turn is left out. When the ith subject is omitted, the likelihood displacement is

2 log upper L left-parenthesis ModifyingAbove bold-italic beta With caret right-parenthesis minus 2 log upper L left-parenthesis ModifyingAbove bold-italic beta With caret Subscript left-parenthesis i right-parenthesis Baseline right-parenthesis

where ModifyingAbove bold-italic beta With caret Subscript left-parenthesis i right-parenthesis is the vector of parameter estimates obtained by fitting the model without the ith subject. Instead of refitting the model without the ith subject, Pettitt and Bin Daud (1989) propose that the likelihood displacement for the ith subject be approximated by

normal upper L normal upper D Subscript i Baseline equals ModifyingAbove bold upper L With caret prime Subscript i Baseline script upper I Superscript negative 1 Baseline left-parenthesis ModifyingAbove bold-italic beta With caret right-parenthesis ModifyingAbove bold upper L With caret Subscript i

where ModifyingAbove bold upper L With caret Subscript i is the score residual vector of the ith subject. This approximation is output to the LD= variable.

The LMAX statistic is another global influence statistic. This statistic is based on the symmetric matrix

bold upper B equals bold upper L script upper I Superscript negative 1 Baseline left-parenthesis ModifyingAbove bold-italic beta With caret right-parenthesis bold upper L prime

where bold upper L is the matrix with rows that are the score residual vectors ModifyingAbove bold upper L With caret Subscript i. The elements of the eigenvector associated with the largest eigenvalue of the matrix bold upper B, standardized to unit length, give a measure of the sensitivity of the fit of the model to each observation in the data. The influence of the ith subject on the global fit of the model is proportional to the magnitude of zeta Subscript i, where zeta Subscript i is the ith element of the vector bold-italic zeta that satisfies

bold upper B bold-italic zeta equals lamda Subscript max Baseline bold-italic zeta normal a normal n normal d bold-italic zeta prime bold-italic zeta equals 1

with lamda Subscript max being the largest eigenvalue of bold upper B. The sign of zeta Subscript i is irrelevant, and its absolute value is output to the LMAX= variable.

When the counting process MODEL specification is used, the LD= and LMAX= variables are set to missing, because these two global influence statistics can be calculated on a per-subject basis only.

Last updated: December 09, 2022