The SURVEYPHREG Procedure

Firth’s Modification for Maximum Likelihood Estimation

In fitting a Cox model, the phenomenon of monotone likelihood is observed if the likelihood converges to a finite value while at least one parameter diverges (Mukhopadhyay 2020; Heinze and Schemper 2001).

Firth (1993) recommended using the penalized likelihood to reduce the first-order bias in estimating the canonical parameters of an exponential family model, where and are the unpenalized likelihood and information matrix, respectively.

Heinze (1999) and Heinze and Schemper (2001) applied the idea of Firth (1993) by maximizing the penalized partial log likelihood

l Superscript asterisk Baseline left-parenthesis bold-italic beta right-parenthesis equals l left-parenthesis bold-italic beta right-parenthesis plus 0.5 log left-parenthesis StartAbsoluteValue script upper I left-parenthesis bold-italic beta right-parenthesis EndAbsoluteValue right-parenthesis

to obtain estimates of regression parameters when a monotone likelihood is observed.

The score function is replaced by the penalized score function, , where

upper U Superscript asterisk Baseline left-parenthesis beta Subscript r Baseline right-parenthesis equals upper U left-parenthesis beta Subscript r Baseline right-parenthesis plus 0.5 normal t normal r StartSet script upper I Superscript negative 1 Baseline left-parenthesis bold-italic beta right-parenthesis StartFraction partial-differential script upper I left-parenthesis bold-italic beta right-parenthesis Over partial-differential beta Subscript r Baseline EndFraction EndSet r equals 1 comma ellipsis comma p

The Firth estimate is obtained iteratively as

bold-italic beta Superscript left-parenthesis s plus 1 right-parenthesis Baseline equals bold-italic beta Superscript left-parenthesis s right-parenthesis Baseline plus script upper I Superscript negative 1 Baseline left-parenthesis bold-italic beta Superscript left-parenthesis s right-parenthesis Baseline right-parenthesis bold upper U Superscript asterisk Baseline left-parenthesis bold-italic beta Superscript left-parenthesis s right-parenthesis Baseline right-parenthesis

Although the estimated regression parameters, , are obtained by maximizing the penalized partial likelihood, the Taylor series linearized variance estimator uses the score residuals and the information matrix from the unpenalized likelihood that are evaluated at . For more information, see the section Taylor Series Linearization.

The replication variance estimation methods use the replicated version of the penalized score function to obtain replicate estimates, , for the regression parameters. The replicate estimates are then used in the replication variance estimation, as described in the sections Balanced Repeated Replication (BRR) Method, Bootstrap Method, Jackknife Method, and Replicate Weights Method.

Explicit Formulas for the Score Function, Fisher Information, and Partial Derivatives for the Information Matrix

Mukhopadhyay (2020) recommended using normalized weights to construct the penalized log partial likelihood for weighted data. Using the notation in the sections Notation and Estimation and Partial Likelihood Function for the Cox Model, the Breslow unpenalized log partial likelihood is given by