The GLMPOWER Procedure

Contrasts in Fixed-Effect Univariate Models

The univariate linear model has the form

bold y equals bold upper X bold-italic beta plus bold-italic epsilon

where is the N 1 vector of responses, is the N k design matrix, is the k 1 vector of model parameters that correspond to the columns of , and is an N 1 vector of errors with

epsilon 1 comma ellipsis comma epsilon Subscript upper N Baseline tilde normal upper N left-parenthesis 0 comma sigma squared right-parenthesis left-parenthesis normal i normal i normal d right-parenthesis

In PROC GLMPOWER, the model parameters are not specified directly, but rather indirectly as , which represents either conjectured response means or typical response values for each design profile. The values are manifested as the dependent variable in the MODEL statement. The vector is obtained from according to the least squares equation,

Note that, in general, there is not a one-to-one mapping between and . Many different scenarios for might lead to the same . If you specify with the intention of representing cell means, keep in mind that PROC GLMPOWER allows scenarios that are not valid cell means according to the model that is specified in the MODEL statement. For example, if exhibits an interaction effect but the corresponding interaction term is left out of the model, then the cell means () that are derived from differ from . In particular, the cell means that are derived in this way are the projection of onto the model space.

It is convenient in power analysis to parameterize the design matrix in three parts, , defined as follows:

The q k essence design matrix is the collection of unique rows of . Its rows are sometimes referred to as "design profiles." Here, q N is defined simply as the number of unique rows of .
The q 1 weight vector reveals the relative proportions of design profiles, and . Row i of is to be included in the design times for every times that row j is included. The weights are assumed to be standardized (that is, they sum up to 1).
The total sample size is N. This is the number of rows in . If you gather copies of the ith row of , for , then you end up with .

The preceding quantities are derived from PROC GLMPOWER syntax as follows:

Values for , , and are specified in the exemplary data set (from using the DATA= option in the PROC GLMPOWER statement), and the corresponding variables are identified in the CLASS, MODEL, and WEIGHT statements.
N is specified in the NTOTAL= option in the POWER statement.

It is useful to express the crossproduct matrix in terms of these three parts,

bold upper X prime bold upper X equals upper N ModifyingAbove bold upper X With two-dots prime bold upper W ModifyingAbove bold upper X With two-dots

because this expression factors out the portion (N) that depends on sample size and the portion () that depends only on the design structure.

A general linear hypothesis for the univariate model has the form

StartLayout 1st Row 1st Column upper H 0 colon 2nd Column bold upper L bold-italic beta equals bold-italic theta 0 2nd Row 1st Column upper H Subscript upper A Baseline colon 2nd Column bold upper L bold-italic beta not-equals bold-italic theta 0 EndLayout

where is an l k contrast matrix with rank and is the null value (usually just a vector of zeros).

Note that model effect tests are just contrasts that use special forms of . Thus, this scheme covers both effect tests (which are specified in the MODEL statement and the EFFECTS= option in the POWER statement) and custom contrasts (which are specified in the CONTRAST statement).

The model degrees of freedom are equal to the rank of , denoted . The error degrees of freedom are equal to N – . The sample size N must be at least plus the number of covariates.