The SURVEYPHREG Procedure

Contrasts and Hypothesis Tests

For a testable hypothesis upper H 0 colon bold upper L bold-italic beta equals 0, you can request different Wald tests by using the DF= option in the MODEL statement.

Let

upper Q equals left-parenthesis bold upper L Superscript asterisk Baseline ModifyingAbove bold-italic beta With caret right-parenthesis prime left-parenthesis bold upper L Superscript asterisk Baseline ModifyingAbove bold upper V With caret bold upper L Superscript asterisk Baseline prime right-parenthesis Superscript minus Baseline left-parenthesis bold upper L Superscript asterisk Baseline ModifyingAbove bold-italic beta With caret right-parenthesis

where bold upper L is a contrast vector or matrix that you specify, bold-italic beta is the vector of regression parameters, ModifyingAbove bold-italic beta With caret is the estimated regression coefficients, ModifyingAbove bold upper V With caret is the estimated covariance matrix of ModifyingAbove bold-italic beta With caret, and bold upper L Superscript asterisk is a matrix such that the following are true:

  • bold upper L Superscript asterisk has the same number of columns as bold upper L.

  • bold upper L Superscript asterisk has full row rank.

  • The rank of bold upper L Superscript asterisk equals the rank of the bold upper L matrix.

  • All rows of bold upper L Superscript asterisk are estimable functions.

  • The Wald F statistic that is computed by using the bold upper L Superscript asterisk matrix is equivalent to the Wald F statistic computed by using the bold upper L matrix.

If bold upper L is a full-rank matrix and all rows of bold upper L are estimable functions, then bold upper L Superscript asterisk is the same as bold upper L. It is possible that such an bold upper L Superscript asterisk matrix cannot be constructed for a given set of linear contrasts, in which case the contrasts are not testable. Let r be the rank of bold upper L ModifyingAbove bold upper V With caret bold upper L prime. Table 9 describes the Wald tests available in PROC SURVEYPHREG.

Table 9: Summary of Wald Tests

Numerator Denominator
Value of DF= Test Request Test Statistic Degrees of Freedom Degrees of Freedom
NONE Chi-square Q r normal infinity
v Customized F vQ/rd r v
DESIGN Unadjusted F Q/r r d
DESIGN (v) Unadjusted F Q/r r v
PARMADJ Adjusted F (dr+1)Q/rd r dr+1
PARMADJ (v) Adjusted F (vr+1)Q/rv r vr+1
DESIGNADJ Adjusted F Q/r r d


The preceding development for Wald tests assumes that the estimated variance of ModifyingAbove bold-italic beta With caret, ModifyingAbove bold upper V With caret, is of the form bold upper X prime bold upper V Superscript negative 1 Baseline bold upper X for some estimate bold upper V of the variance of bold upper Y. In this case, estimability, bold upper L left-parenthesis bold upper X prime bold upper X right-parenthesis Superscript minus Baseline left-parenthesis bold upper X prime bold upper X right-parenthesis equals bold upper L, ensures that this F statistic has a unique value no matter which kind of generalized inverse is used to compute it. However, when a design-based variance estimator is used to estimate the variability of ModifyingAbove bold-italic beta With caret, estimability does not ensure uniqueness. In this case, the F value is invariant to the choice of the generalized inverse if and only if bold upper L is estimable and bold upper L prime left-bracket bold upper L ModifyingAbove bold upper V With caret bold upper L prime right-bracket Superscript minus Baseline left-bracket bold upper L ModifyingAbove bold upper V With caret bold upper L prime right-bracket equals bold upper L prime.

Although it is extremely rare, it is possible in practice that the preceding uniqueness condition is not satisfied. For example, if the number of clusters is less than the number of nonsingular parameters in the model, then the matrix of coefficients for testing the overall null does not satisfy the uniqueness condition. If this condition is not satisfied, then the F statistic for testing upper H colon bold upper L bold-italic beta equals bold 0 is not invariant to the choice of the g 2-inverse of bold upper L ModifyingAbove bold upper V With caret bold upper L prime. In practical applications, the test statistic is compared with an F distribution as described in Table 9, but the value of the test statistic and therefore the inference might be different when a different g 2-inverse is used. This F test is not recommended when the uniqueness condition is not satisfied. An alternative approach would be to increase the number of clusters or to find a parsimonious model so that the number of parameters is less than the number of clusters.

Last updated: March 08, 2022