Introduction to Regression Procedures

Testing Linear Hypotheses

Testing of linear hypotheses based on estimable functions is discussed in the section Test of Hypotheses in Chapter 3, Introduction to Statistical Modeling with SAS/STAT Software, and the construction of special sets of estimable functions corresponding to Type I–Type IV hypotheses is discussed in Chapter 16, The Four Types of Estimable Functions. In linear regression models, testing of general linear hypotheses follows along the same lines. Test statistics are usually formed based on sums of squares that are associated with the hypothesis in question. Furthermore, when bold upper X is of full rank—as is the case in many regression models—the consistency of the linear hypothesis is guaranteed.

Recall from Chapter 3, Introduction to Statistical Modeling with SAS/STAT Software, that the general form of a linear hypothesis for the parameters is upper H colon bold upper L bold-italic beta equals bold d, where bold upper L is q times k, bold-italic beta is k times 1, and bold d is q times 1. To test this hypothesis, you take the linear function with respect to the parameter estimates: bold upper L ModifyingAbove bold-italic beta With caret minus bold d. This linear function in ModifyingAbove bold-italic beta With caret has variance

normal upper V normal a normal r left-bracket bold upper L ModifyingAbove bold-italic beta With caret right-bracket equals bold upper L normal upper V normal a normal r left-bracket ModifyingAbove bold-italic beta With caret right-bracket bold upper L Superscript prime Baseline equals sigma squared bold upper L left-parenthesis bold upper X prime bold upper W bold upper X right-parenthesis Superscript minus Baseline bold upper L prime

The sum of squares due to the hypothesis is a simple quadratic form:

normal upper S normal upper S left-parenthesis upper H right-parenthesis equals normal upper S normal upper S left-parenthesis bold upper L ModifyingAbove bold-italic beta With caret minus bold d right-parenthesis equals left-parenthesis bold upper L ModifyingAbove bold-italic beta With caret minus bold d right-parenthesis prime left-parenthesis bold upper L left-parenthesis bold upper X prime bold upper W bold upper X right-parenthesis Superscript minus Baseline bold upper L prime right-parenthesis Superscript negative 1 Baseline left-parenthesis bold upper L ModifyingAbove bold-italic beta With caret minus bold d right-parenthesis

If this hypothesis is testable, then normal upper S normal upper S left-parenthesis upper H right-parenthesis can be used in the numerator of an F statistic:

upper F equals StartFraction SS left-parenthesis upper H right-parenthesis slash q Over s squared EndFraction equals StartFraction SS left-parenthesis bold upper L bold b minus bold d right-parenthesis slash q Over s squared EndFraction

If ModifyingAbove bold-italic beta With caret is normally distributed, which follows as a consequence of normally distributed model errors, then this statistic follows an F distribution with q numerator degrees of freedom and n minus normal r normal a normal n normal k left-parenthesis bold upper X right-parenthesis denominator degrees of freedom. Note that it was assumed in this derivation that bold upper L is of full row rank q.

Last updated: December 09, 2022