The SURVEYREG Procedure

Testing

Testing Effects

For each effect in the model, PROC SURVEYREG computes an bold upper L matrix such that every element of bold upper L bold-italic beta is estimable; the bold upper L matrix has the maximum possible rank that is associated with the effect. To test the effect, the procedure uses the Wald F statistic for the hypothesis upper H 0 colon bold upper L bold-italic beta equals 0. The Wald F statistic equals

upper F Subscript Wald Baseline equals StartFraction left-parenthesis bold upper L ModifyingAbove bold-italic beta With caret right-parenthesis prime left-parenthesis bold upper L ModifyingAbove bold upper V With caret bold upper L prime right-parenthesis Superscript minus Baseline left-parenthesis bold upper L ModifyingAbove bold-italic beta With caret right-parenthesis Over rank left-parenthesis bold upper L ModifyingAbove bold upper V With caret bold upper L prime right-parenthesis EndFraction

with numerator degrees of freedom equal to normal r normal a normal n normal k left-parenthesis bold upper L ModifyingAbove bold upper V With caret bold upper L prime right-parenthesis.

In the Taylor series method, the denominator degrees of freedom is equal to the number of clusters minus the number of strata (unless you specify the denominator degrees of freedom with the DF= option in the MODEL statement). For details about denominator degrees of freedom in replication methods, see the section Denominator Degrees of Freedom. It is possible that the bold upper L matrix cannot be constructed for an effect, in which case that effect is not testable. For more information about how the matrix bold upper L is constructed, see the discussion in ChapterĀ 16, The Four Types of Estimable Functions.

You can use the TEST statement to perform F tests that test Type I, Type II, or Type III hypotheses. For details about the syntax of the TEST statement, see the section TEST Statement in ChapterĀ 20, Shared Concepts and Topics.

Contrasts

You can use the CONTRAST statement to perform custom hypothesis tests. If the hypothesis is testable in the univariate case, the Wald F statistic for upper H 0 colon bold upper L bold-italic beta equals 0 is computed as

upper F Subscript Wald Baseline equals StartFraction left-parenthesis bold upper L Subscript Full Baseline ModifyingAbove bold-italic beta With caret right-parenthesis prime left-parenthesis bold upper L Subscript Full Baseline ModifyingAbove bold upper V With caret bold upper L prime Subscript Full right-parenthesis Superscript minus Baseline left-parenthesis bold upper L Subscript Full Baseline ModifyingAbove bold-italic beta With caret right-parenthesis Over rank left-parenthesis bold upper L right-parenthesis EndFraction

where bold upper L is the contrast vector or matrix you specify, bold-italic beta is the vector of regression parameters, ModifyingAbove bold-italic beta With caret equals bold left-parenthesis bold upper X prime bold upper W bold upper X bold right-parenthesis Superscript bold minus Baseline bold upper X prime bold upper W bold upper Y, ModifyingAbove bold upper V With caret is the estimated covariance matrix of ModifyingAbove bold-italic beta With caret, rank(bold upper L) is the rank of bold upper L, and bold upper L Subscript normal upper F normal u normal l normal l is a matrix such that

  • bold upper L Subscript normal upper F normal u normal l normal l has the same number of columns as bold upper L

  • bold upper L Subscript normal upper F normal u normal l normal l has full row rank

  • the rank of bold upper L Subscript normal upper F normal u normal l normal l equals the rank of the bold upper L matrix

  • all rows of bold upper L Subscript normal upper F normal u normal l normal l are estimable functions

  • the Wald F statistic computed using the bold upper L Subscript normal upper F normal u normal l normal l matrix is equivalent to the Wald F statistic computed by using the bold upper L matrix with any row deleted that is a linear combination of previous rows

If bold upper L is a full-rank matrix and all rows of bold upper L are estimable functions, then bold upper L Subscript normal upper F normal u normal l normal l is the same as bold upper L. It is possible that bold upper L Subscript normal upper F normal u normal l normal l matrix cannot be constructed for contrasts in a CONTRAST statement, in which case the contrasts are not testable.

A Note on upper F Subscript Wald

The preceding tests that use the upper F Subscript Wald statistic assume that the estimated variance of ModifyingAbove bold-italic beta With caret, ModifyingAbove bold upper V With caret, is of the form bold upper X prime bold upper V Superscript negative 1 Baseline bold upper X for some estimate bold upper V of the variance of bold upper Y. In this case, estimability, bold upper L left-parenthesis bold upper X prime bold upper X right-parenthesis Superscript minus Baseline left-parenthesis bold upper X prime bold upper X right-parenthesis equals bold upper L, ensures that this upper F Subscript Wald statistic has a unique value no matter which kind of generalized inverse is used to compute it. However, when a design-based variance estimator is used to estimate the variability of ModifyingAbove bold-italic beta With caret, estimability does not ensure uniqueness. In this case, the upper F Subscript Wald value is invariant to the choice of the generalized inverse if and only if bold upper L is estimable and bold upper L prime left-bracket bold upper L ModifyingAbove bold upper V With caret bold upper L prime right-bracket Superscript minus Baseline left-bracket bold upper L ModifyingAbove bold upper V With caret bold upper L prime right-bracket equals bold upper L prime.

Although it is extremely rare, it is possible in practice that the preceding uniqueness condition is not satisfied. For example, if the number of PSUs is less than the number of nonsingular parameters in the model, then the matrix of coefficients for testing the overall null does not satisfy the uniqueness condition. If this condition is not satisfied, then the upper F Subscript Wald statistic for testing upper H colon bold upper L bold-italic beta equals bold 0 is not invariant to the choice of the g 2-inverse of bold upper L ModifyingAbove bold upper V With caret bold upper L prime. In practical applications, the test statistic is compared with an F distribution, but the value of the test statistic and therefore the inference might be different when a different g 2-inverse is used. Thus, this F test is not recommended when the uniqueness condition is not satisfied. An alternative approach would be to increase the number of PSUs or to find a parsimonious model so that the number of parameters is less than the number of PSUs.

Last updated: December 09, 2022