The MIXED Procedure

REPEATED Statement

  • REPEATED <repeated-effect> </ options>;

The REPEATED statement is used to specify the bold upper R matrix in the mixed model. Its syntax is different from that of the REPEATED statement in PROC GLM. If no REPEATED statement is specified, bold upper R is assumed to be equal to sigma squared bold upper I.

For many repeated measures models, no repeated effect is required in the REPEATED statement. Simply use the SUBJECT= option to define the blocks of bold upper R and the TYPE= option to define their covariance structure. In this case, the repeated measures data must be similarly ordered for each subject, and you must indicate all missing response variables with periods in the input data set unless they all fall at the end of a subject’s repeated response profile. These requirements are necessary in order to inform PROC MIXED of the proper location of the observed repeated responses.

Specifying a repeated effect is useful when you do not want to indicate missing values with periods in the input data set. The repeated effect must contain only classification variables. Make sure that the levels of the repeated effect are different for each observation within a subject; otherwise, PROC MIXED constructs identical rows in bold upper R corresponding to the observations with the same level. This results in a singular bold upper R and an infinite likelihood.

Whether you specify a REPEATED effect or not, the rows of bold upper R for each subject are constructed in the order in which they appear in the input data set.

Table 16 summarizes the options available in the REPEATED statement. All options are subsequently discussed in alphabetical order.

Table 16: Summary of REPEATED Statement Options

Option Description
Construction of Covariance Structure
GROUP= Defines an effect specifying heterogeneity in the R-side covariance structure
LDATA= Specifies data set with coefficient matrices for TYPE=LIN
LOCAL Requests that a diagonal matrix be added to bold upper R
LOCALW Specifies that only the local effects are weighted
NONLOCALW Specifies that only the nonlocal effects are weighted
SUBJECT= Identifies the subjects in the R-side model
TYPE= Specifies the R-side covariance structure
Statistical Output
HLM Produces a table of Hotelling-Lawley-McKeon statistics (McKeon 1974)
HLPS Produces a table of Hotelling-Lawley-Pillai-Samson statistics (Pillai and Samson 1959)
R Displays blocks of the estimated bold upper R matrix
RC Display the Cholesky root (lower) of blocks of the estimated bold upper R matrix
RCI Displays the inverse Cholesky root (lower) of blocks of the estimated bold upper R matrix
RCORR Displays the correlation matrix corresponding to blocks of the estimated bold upper R matrix
RI Displays the inverse of blocks of the estimated bold upper R matrix


You can specify the following options in the REPEATED statement after a slash (/).

GROUP=effect
GRP=effect

defines an effect that specifies heterogeneity in the covariance structure of bold upper R. All observations that have the same level of the GROUP effect have the same covariance parameters. Each new level of the GROUP effect produces a new set of covariance parameters with the same structure as the original group. You should exercise caution in properly defining the GROUP effect, because strange covariance patterns can result with its misuse. Also, the GROUP effect can greatly increase the number of estimated covariance parameters, which can adversely affect the optimization process.

Continuous variables are permitted as arguments to the GROUP= option. PROC MIXED does not sort by the values of the continuous variable; rather, it considers the data to be from a new subject or group whenever the value of the continuous variable changes from the previous observation. Using a continuous variable decreases execution time for models with a large number of subjects or groups and also prevents the production of a large "Class Level Information" table.

HLM

produces a table of Hotelling-Lawley-McKeon statistics (McKeon 1974) for all fixed effects whose levels change across data having the same level of the SUBJECT= effect (the within-subject fixed effects). This option applies only when you specify a REPEATED statement with the TYPE=UN option and no RANDOM statements. For balanced data, this model is equivalent to the multivariate model for repeated measures in PROC GLM.

The Hotelling-Lawley-McKeon statistic has a slightly better F approximation than the Hotelling-Lawley-Pillai-Samson statistic (see the description of the HLPS option, which follows). Both of the Hotelling-Lawley statistics can perform much better in small samples than the default F statistic (Wright 1994).

Separate tables are produced for Type 1, 2, and 3 tests, according to the ones you select. The ODS table names are HLM1, HLM2, and HLM3, respectively.

HLPS

produces a table of Hotelling-Lawley-Pillai-Samson statistics (Pillai and Samson 1959) for all fixed effects whose levels change across data having the same level of the SUBJECT= effect (the within-subject fixed effects). This option applies only when you specify a REPEATED statement with the TYPE=UN option and no RANDOM statements. For balanced data, this model is equivalent to the multivariate model for repeated measures in PROC GLM, and this statistic is the same as the Hotelling-Lawley Trace statistic produced by PROC GLM.

Separate tables are produced for Type 1, 2, and 3 tests, according to the ones you select. The ODS table names are HLPS1, HLPS2, and HLPS3, respectively.

LDATA=SAS-data-set

reads the coefficient matrices associated with the TYPE=LIN(number) option. The data set must contain the variables Parm, Row, Col1Coln or Parm, Row, Col, Value. The Parm variable denotes which of the number coefficient matrices is currently being constructed, and the Row, Col1Coln, or Row, Col, Value variables specify the matrix values, as they do with the RANDOM statement option GDATA=. Unspecified values of these matrices are set equal to 0.

LOCAL
LOCAL=EXP(<effects>)
LOCAL=POM(POM-data-set)

requests that a diagonal matrix be added to bold upper R. With just the LOCAL option, this diagonal matrix equals sigma squared bold upper I, and sigma squared becomes an additional variance parameter that PROC MIXED profiles out of the likelihood provided that you do not specify the NOPROFILE option in the PROC MIXED statement. The LOCAL option is useful if you want to add an observational error to a time series structure (Jones and Boadi-Boateng 1991) or a nugget effect to a spatial structure Cressie (1993).

The LOCAL=EXP(<effects>) option produces exponential local effects, also known as dispersion effects, in a log-linear variance model. These local effects have the form

sigma squared normal d normal i normal a normal g left-bracket normal e normal x normal p left-parenthesis bold upper U bold-italic delta right-parenthesis right-bracket

where bold upper U is the full-rank design matrix corresponding to the effects that you specify and bold-italic delta are the parameters that PROC MIXED estimates. An intercept is not included in bold upper U because it is accounted for by sigma squared. PROC MIXED constructs the full-rank bold upper U in terms of 1s and –1s for classification effects. Be sure to scale continuous effects in bold upper U sensibly.

The LOCAL=POM(POM-data-set) option specifies the power-of-the-mean structure. This structure possesses a variance of the form sigma squared StartAbsoluteValue bold x prime Subscript i Baseline bold-italic beta Superscript asterisk Baseline EndAbsoluteValue Superscript theta for the ith observation, where bold x Subscript i is the ith row of bold upper X (the design matrix of the fixed effects) and bold-italic beta Superscript asterisk is an estimate of the fixed-effects parameters that you specify in POM-data-set.

The SAS data set specified by POM-data-set contains the numeric variable Estimate (in previous releases, the variable name was required to be EST), and it has at least as many observations as there are fixed-effects parameters. The first p observations of the Estimate variable in POM-data-set are taken to be the elements of bold-italic beta Superscript asterisk, where p is the number of columns of bold upper X. You must order these observations according to the non-full-rank parameterization of the MIXED procedure. One easy way to set up POM-data-set for a bold-italic beta Superscript asterisk corresponding to ordinary least squares is illustrated by the following statements:

ods output SolutionF=sf;
proc mixed;
   class a;
   model y = a x / s;
run;

proc mixed;
   class a;
   model y = a x;
   repeated / local=pom(sf);
run;

Note that the generalized least squares estimate of the fixed-effects parameters from the second PROC MIXED step usually is not the same as your specified bold-italic beta Superscript asterisk. However, you can iterate the POM fitting until the two estimates agree. Continuing from the previous example, the statements for performing one step of this iteration are as follows:

ods output SolutionF=sf1;
proc mixed;
   class a;
   model y = a x / s;
   repeated / local=pom(sf);
run;

proc compare brief data=sf compare=sf1;
   var estimate;
run;

data sf;
   set sf1;
run;

Unfortunately, this iterative process does not always converge. For further details, see the description of pseudo-likelihood in Chapter 3 of Carroll and Ruppert (1988).

LOCALW

specifies that only the local effects and no others be weighted. By default, all effects are weighted. The LOCALW option is used in connection with the WEIGHT statement and the LOCAL option in the REPEATED statement.

NONLOCALW

specifies that only the nonlocal effects and no others be weighted. By default, all effects are weighted. The NONLOCALW option is used in connection with the WEIGHT statement and the LOCAL option in the REPEATED statement.

R<=value-list>

requests that blocks of the estimated bold upper R matrix be displayed. The first block determined by the SUBJECT= effect is the default displayed block. PROC MIXED displays blanks for value-lists that are 0.

The value-list indicates the subjects for which blocks of bold upper R are to be displayed. For example, the following statement displays block matrices for the first, third, and fifth persons:

repeated / type=cs subject=person r=1,3,5;

See the PARMS statement for the possible forms of value-list. The ODS table name is R.

RC<=value-list>

produces the Cholesky root of blocks of the estimated bold upper R matrix. The value-list specification is the same as with the R option. The ODS table name is CholR.

RCI<=value-list>

produces the inverse Cholesky root of blocks of the estimated bold upper R matrix. The value-list specification is the same as with the R option. The ODS table name is InvCholR.

RCORR<=value-list>

produces the correlation matrix corresponding to blocks of the estimated bold upper R matrix. The value-list specification is the same as with the R option. The ODS table name is RCorr.

RI<=value-list>

produces the inverse of blocks of the estimated bold upper R matrix. The value-list specification is the same as with the R option. The ODS table name is InvR.

SSCP

requests that an unstructured bold upper R matrix be estimated from the sum-of-squares-and-crossproducts matrix of the residuals. It applies only when you specify TYPE=UN and have no RANDOM statements. Also, you must have a sufficient number of subjects for the estimate to be positive definite.

This option is useful when the size of the blocks of bold upper R is large (for example, greater than 10) and you want to use or inspect an unstructured estimate that is much quicker to compute than the default REML estimate. The two estimates will agree for certain balanced data sets when you have a classification fixed effect defined across all time points within a subject.

SUBJECT=effect
SUB=effect

identifies the subjects in your mixed model. Complete independence is assumed across subjects; therefore, the SUBJECT= option produces a block-diagonal structure in bold upper R with identical blocks. When the SUBJECT= effect consists entirely of classification variables, the blocks of bold upper R correspond to observations sharing the same level of that effect. These blocks are sorted according to this effect as well.

Continuous variables are permitted as arguments to the SUBJECT= option. PROC MIXED does not sort by the values of the continuous variable; rather, it considers the data to be from a new subject or group whenever the value of the continuous variable changes from the previous observation. Using a continuous variable decreases execution time for models with a large number of subjects or groups and also prevents the production of a large "Class Level Information" table.

If you want to model nonzero covariance among all of the observations in your SAS data set, specify SUBJECT=INTERCEPT to treat the data as if they are all from one subject. However, be aware that in this case PROC MIXED manipulates an bold upper R matrix with dimensions equal to the number of observations. If no SUBJECT= effect is specified, then every observation is assumed to be from a different subject and bold upper R is assumed to be diagonal. For this reason, you usually want to use the SUBJECT= option in the REPEATED statement.

TYPE=covariance-structure

specifies the covariance structure of the bold upper R matrix. The SUBJECT= option defines the blocks of bold upper R, and the TYPE= option specifies the structure of these blocks. Valid values for covariance-structure and their descriptions are provided in Table 17 and Table 18. The default structure is VC.

Table 17: Covariance Structures

Structure Description Parms left-parenthesis i comma j right-parenthesis element
ANTE(1) Antedependence 2 t minus 1 sigma Subscript i Baseline sigma Subscript j Baseline product Underscript k equals i Overscript j minus 1 Endscripts rho Subscript k
AR(1) Autoregressive(1) 2 sigma squared rho Superscript StartAbsoluteValue i minus j EndAbsoluteValue
ARH(1) Heterogeneous AR(1) t plus 1 sigma Subscript i Baseline sigma Subscript j Baseline rho Superscript StartAbsoluteValue i minus j EndAbsoluteValue
ARMA(1,1) ARMA(1,1) 3 sigma squared left-bracket gamma rho Superscript StartAbsoluteValue i minus j EndAbsoluteValue minus 1 Baseline Baseline 1 left-parenthesis i not-equals j right-parenthesis plus 1 left-parenthesis i equals j right-parenthesis right-bracket
CS Compound symmetry 2 sigma 1 plus sigma squared 1 left-parenthesis i equals j right-parenthesis
CSH Heterogeneous CS t plus 1 sigma Subscript i Baseline sigma Subscript j Baseline left-bracket rho Baseline 1 left-parenthesis i not-equals j right-parenthesis plus 1 left-parenthesis i equals j right-parenthesis right-bracket
FA(q) Factor analytic StartFraction q Over 2 EndFraction left-parenthesis 2 t minus q plus 1 right-parenthesis plus t normal upper Sigma Subscript k equals 1 Superscript min left-parenthesis i comma j comma q right-parenthesis Baseline lamda Subscript i k Baseline lamda Subscript j k plus sigma Subscript i Superscript 2 Baseline 1 left-parenthesis i equals j right-parenthesis
FA0(q) No diagonal FA StartFraction q Over 2 EndFraction left-parenthesis 2 t minus q plus 1 right-parenthesis normal upper Sigma Subscript k equals 1 Superscript min left-parenthesis i comma j comma q right-parenthesis Baseline lamda Subscript i k Baseline lamda Subscript j k
FA1(q) Equal diagonal FA StartFraction q Over 2 EndFraction left-parenthesis 2 t minus q plus 1 right-parenthesis plus 1 normal upper Sigma Subscript k equals 1 Superscript min left-parenthesis i comma j comma q right-parenthesis Baseline lamda Subscript i k Baseline lamda Subscript j k plus sigma squared 1 left-parenthesis i equals j right-parenthesis
HF Huynh-Feldt t plus 1 left-parenthesis sigma Subscript i Superscript 2 Baseline plus sigma Subscript j Superscript 2 Baseline right-parenthesis slash 2 plus lamda Baseline 1 left-parenthesis i not-equals j right-parenthesis
LIN(q) General linear q normal upper Sigma Subscript k equals 1 Superscript q Baseline theta Subscript k Baseline bold upper A Subscript i j
TOEP Toeplitz t sigma Subscript StartAbsoluteValue i minus j EndAbsoluteValue plus 1
TOEP(q) Banded Toeplitz q sigma Subscript StartAbsoluteValue i minus j EndAbsoluteValue plus 1 Baseline 1 left-parenthesis StartAbsoluteValue i minus j EndAbsoluteValue less-than q right-parenthesis
TOEPH Heterogeneous TOEP 2 t minus 1 sigma Subscript i Baseline sigma Subscript j Baseline rho Subscript StartAbsoluteValue i minus j EndAbsoluteValue
TOEPH(q) Banded hetero TOEP t plus q minus 1 sigma Subscript i Baseline sigma Subscript j Baseline rho Subscript StartAbsoluteValue i minus j EndAbsoluteValue Baseline 1 left-parenthesis StartAbsoluteValue i minus j EndAbsoluteValue less-than q right-parenthesis
UN Unstructured t left-parenthesis t plus 1 right-parenthesis slash 2 sigma Subscript i j
UN(q) Banded StartFraction q Over 2 EndFraction left-parenthesis 2 t minus q plus 1 right-parenthesis sigma Subscript i j Baseline 1 left-parenthesis StartAbsoluteValue i minus j EndAbsoluteValue less-than q right-parenthesis
UNR Unstructured corrs t left-parenthesis t plus 1 right-parenthesis slash 2 sigma Subscript i Baseline sigma Subscript j Baseline rho Subscript max left-parenthesis i comma j right-parenthesis min left-parenthesis i comma j right-parenthesis
UNR(q) Banded correlations StartFraction q Over 2 EndFraction left-parenthesis 2 t minus q plus 1 right-parenthesis sigma Subscript i Baseline sigma Subscript j Baseline rho Subscript max left-parenthesis i comma j right-parenthesis min left-parenthesis i comma j right-parenthesis
UN@AR(1) Direct product AR(1) t 1 left-parenthesis t 1 plus 1 right-parenthesis slash 2 plus 1 sigma Subscript i 1 j 1 Baseline rho Superscript StartAbsoluteValue i 2 minus j 2 EndAbsoluteValue
UN@CS Direct product CS t 1 left-parenthesis t 1 plus 1 right-parenthesis slash 2 plus 1 StartLayout Enlarged left-brace 1st Row 1st Column sigma Subscript i 1 j 1 2nd Column i 2 equals j 2 2nd Row 1st Column sigma squared sigma Subscript i 1 j 1 2nd Column i 2 not-equals j 2 3rd Row 1st Column 0 less-than-or-equal-to sigma squared less-than-or-equal-to 1 2nd Column Blank EndLayout
UN@UN Direct product UN t 1 left-parenthesis t 1 plus 1 right-parenthesis slash 2 plus sigma Subscript 1 comma i 1 j 1 Baseline sigma Subscript 2 comma i 2 j 2
t 2 left-parenthesis t 2 plus 1 right-parenthesis slash 2 minus 1
VC Variance components q sigma Subscript k Superscript 2 Baseline 1 left-parenthesis i equals j right-parenthesis
and i corresponds to kth effect


In Table 17, "Parms" is the number of covariance parameters in the structure, t is the overall dimension of the covariance matrix, and 1 left-parenthesis upper A right-parenthesis equals 1 when A is true and 0 otherwise. For example, 1left-parenthesis i equals j right-parenthesis equals 1 when i equals j and 0 otherwise, and 1left-parenthesis StartAbsoluteValue i minus j EndAbsoluteValue less-than q right-parenthesis equals 1 when StartAbsoluteValue i minus j EndAbsoluteValue less-than q and 0 otherwise. For the TYPE=TOEPH structures, rho 0 equals 1, and for the TYPE=UNR structures, rho Subscript i i Baseline equals 1 for all i. For the direct product structures, the subscripts "1" and "2" see the first and second structure in the direct product, respectively, and i 1 equals normal i normal n normal t left-parenthesis left-parenthesis i plus t 2 minus 1 right-parenthesis slash t 2 right-parenthesis, j 1 equals normal i normal n normal t left-parenthesis left-parenthesis j plus t 2 minus 1 right-parenthesis slash t 2 right-parenthesis, i 2 equals normal m normal o normal d left-parenthesis i minus 1 comma t 2 right-parenthesis plus 1, and j 2 equals normal m normal o normal d left-parenthesis j minus 1 comma t 2 right-parenthesis plus 1.

Table 18: Spatial Covariance Structures

Structure Description Parms left-parenthesis i comma j right-parenthesis element
SP(EXP)(c-list) Exponential 2 sigma squared exp left-brace minus d Subscript i j Baseline slash theta right-brace
SP(EXPA)(c-list) Anisotropic exponential 2 c plus 1 sigma squared product Underscript k equals 1 Overscript c Endscripts exp left-brace minus theta Subscript k Baseline d left-parenthesis i comma j comma k right-parenthesis Superscript p Super Subscript k Superscript Baseline right-brace
SP(EXPGA)(sans-serif-italic c Subscript 1 Baseline sans-serif-italic c Subscript 2) 2D exponential, 4 sigma squared exp left-brace minus d Subscript i j Baseline left-parenthesis theta comma lamda right-parenthesis slash rho right-brace
geometrically anisotropic
SP(GAU)(c-list) Gaussian 2 sigma squared exp left-brace minus d Subscript i j Superscript 2 Baseline slash rho squared right-brace
SP(GAUGA)(sans-serif-italic c Subscript 1 Baseline sans-serif-italic c Subscript 2) 2D Gaussian, 4 sigma squared exp left-brace minus d Subscript i j Baseline left-parenthesis theta comma lamda right-parenthesis squared slash rho squared right-brace
geometrically anisotropic
SP(LIN)(c-list) Linear 2 sigma squared left-parenthesis 1 minus rho d Subscript i j Baseline right-parenthesis 1 left-parenthesis rho d Subscript i j Baseline less-than-or-equal-to 1 right-parenthesis
SP(LINL)(c-list) Linear log 2 sigma squared left-parenthesis 1 minus rho log left-parenthesis d Subscript i j Baseline right-parenthesis right-parenthesis
times 1 left-parenthesis rho log left-parenthesis d Subscript i j Baseline right-parenthesis less-than-or-equal-to 1 comma d Subscript i j Baseline greater-than 0 right-parenthesis
SP(LEAR)(c-list) Linear exponent autoregressive 3 sigma squared rho Superscript d Super Subscript normal m normal i normal n Superscript plus delta left-bracket left-parenthesis d Super Subscript i j Superscript minus d Super Subscript normal m normal i normal n Superscript right-parenthesis slash left-parenthesis d Super Subscript normal m normal a normal x Superscript minus d Super Subscript normal m normal i normal n Superscript right-parenthesis right-bracket
SP(MATERN)(c-list) Matérn 3 sigma squared StartFraction 1 Over normal upper Gamma left-parenthesis nu right-parenthesis EndFraction left-parenthesis StartFraction d Subscript i j Baseline Over 2 rho EndFraction right-parenthesis Superscript nu Baseline 2 upper K Subscript nu Baseline left-parenthesis d Subscript i j Baseline slash rho right-parenthesis
SP(MATHSW)(c-list) Matérn 3 sigma squared StartFraction 1 Over normal upper Gamma left-parenthesis nu right-parenthesis EndFraction left-parenthesis StartFraction d Subscript i j Baseline StartRoot nu EndRoot Over rho EndFraction right-parenthesis Superscript nu Baseline 2 upper K Subscript nu Baseline left-parenthesis StartFraction 2 d Subscript i j Baseline StartRoot nu EndRoot Over rho EndFraction right-parenthesis
(Handcock-Stein-Wallis)
SP(POW)(c-list) Power 2 sigma squared rho Superscript d Super Subscript i j
SP(POWA)(c-list) Anisotropic power c plus 1 sigma squared rho 1 Superscript d left-parenthesis i comma j comma 1 right-parenthesis Baseline rho 2 Superscript d left-parenthesis i comma j comma 2 right-parenthesis Baseline ellipsis rho Subscript c Superscript d left-parenthesis i comma j comma c right-parenthesis
SP(SPH)(c-list) Spherical 2 sigma squared left-bracket 1 minus left-parenthesis StartFraction 3 d Subscript i j Baseline Over 2 rho EndFraction right-parenthesis plus left-parenthesis StartFraction d Subscript i j Superscript 3 Baseline Over 2 rho cubed EndFraction right-parenthesis right-bracket 1 left-parenthesis d Subscript i j Baseline less-than-or-equal-to rho right-parenthesis
SP(SPHGA)(sans-serif-italic c Subscript 1 Baseline sans-serif-italic c Subscript 2) 2D spherical, 4 sigma squared left-bracket 1 minus left-parenthesis StartFraction 3 d Subscript i j Baseline left-parenthesis theta comma lamda right-parenthesis Over 2 rho EndFraction right-parenthesis plus left-parenthesis StartFraction d Subscript i j Baseline left-parenthesis theta comma lamda right-parenthesis cubed Over 2 rho cubed EndFraction right-parenthesis right-bracket
geometrically anisotropic times 1 left-parenthesis d Subscript i j Baseline left-parenthesis theta comma lamda right-parenthesis less-than-or-equal-to rho right-parenthesis


In Table 18, c-list contains the names of the numeric variables used as coordinates of the location of the observation in space, and d Subscript i j is the Euclidean distance between the ith and jth vectors of these coordinates, which correspond to the ith and jth observations in the input data set. For SP(POWA) and SP(EXPA), c is the number of coordinates, and d left-parenthesis i comma j comma k right-parenthesis is the absolute distance between the kth coordinate, k equals 1 comma ellipsis comma c, of the ith and jth observations in the input data set. For the geometrically anisotropic structures SP(EXPGA), SP(GAUGA), and SP(SPHGA), exactly two spatial coordinate variables must be specified as sans-serif-italic c Subscript 1 and sans-serif-italic c Subscript 2. Geometric anisotropy is corrected by applying a rotation theta and scaling lamda to the coordinate system, and d Subscript i j Baseline left-parenthesis theta comma lamda right-parenthesis represents the Euclidean distance between two points in the transformed space. SP(MATERN) and SP(MATHSW) represent covariance structures in a class defined by Matérn (see Matérn 1986; Handcock and Stein 1993; Handcock and Wallis 1994). The function upper K Subscript nu is the modified Bessel function of the second kind of (real) order nu greater-than 0; the parameter nu governs the smoothness of the process (see below for more details).

Table 19 lists some examples of the structures in Table 17 and Table 18.

Table 19: Covariance Structure Examples

Description Structure Example
Variance
components
VC (default) Start 4 By 4 Matrix 1st Row 1st Column sigma Subscript upper B Superscript 2 2nd Column 0 3rd Column 0 4th Column 0 2nd Row 1st Column 0 2nd Column sigma Subscript upper B Superscript 2 3rd Column 0 4th Column 0 3rd Row 1st Column 0 2nd Column 0 3rd Column sigma Subscript upper A upper B Superscript 2 4th Column 0 4th Row 1st Column 0 2nd Column 0 3rd Column 0 4th Column sigma Subscript upper A upper B Superscript 2 EndMatrix
Compound
symmetry
CS Start 4 By 4 Matrix 1st Row 1st Column sigma squared plus sigma 1 2nd Column sigma 1 3rd Column sigma 1 4th Column sigma 1 2nd Row 1st Column sigma 1 2nd Column sigma squared plus sigma 1 3rd Column sigma 1 4th Column sigma 1 3rd Row 1st Column sigma 1 2nd Column sigma 1 3rd Column sigma squared plus sigma 1 4th Column sigma 1 4th Row 1st Column sigma 1 2nd Column sigma 1 3rd Column sigma 1 4th Column sigma squared plus sigma 1 EndMatrix
Unstructured UN Start 4 By 4 Matrix 1st Row 1st Column sigma 1 squared 2nd Column sigma 21 3rd Column sigma 31 4th Column sigma 41 2nd Row 1st Column sigma 21 2nd Column sigma 2 squared 3rd Column sigma 32 4th Column sigma 42 3rd Row 1st Column sigma 31 2nd Column sigma 32 3rd Column sigma 3 squared 4th Column sigma 43 4th Row 1st Column sigma 41 2nd Column sigma 42 3rd Column sigma 43 4th Column sigma 4 squared EndMatrix
Banded main
diagonal
UN(1) Start 4 By 4 Matrix 1st Row 1st Column sigma 1 squared 2nd Column 0 3rd Column 0 4th Column 0 2nd Row 1st Column 0 2nd Column sigma 2 squared 3rd Column 0 4th Column 0 3rd Row 1st Column 0 2nd Column 0 3rd Column sigma 3 squared 4th Column 0 4th Row 1st Column 0 2nd Column 0 3rd Column 0 4th Column sigma 4 squared EndMatrix
First-order
autoregressive
AR(1) sigma squared Start 4 By 4 Matrix 1st Row 1st Column 1 2nd Column rho 3rd Column rho squared 4th Column rho cubed 2nd Row 1st Column rho 2nd Column 1 3rd Column rho 4th Column rho squared 3rd Row 1st Column rho squared 2nd Column rho 3rd Column 1 4th Column rho 4th Row 1st Column rho cubed 2nd Column rho squared 3rd Column rho 4th Column 1 EndMatrix
Toeplitz TOEP Start 4 By 4 Matrix 1st Row 1st Column sigma squared 2nd Column sigma 1 3rd Column sigma 2 4th Column sigma 3 2nd Row 1st Column sigma 1 2nd Column sigma squared 3rd Column sigma 1 4th Column sigma 2 3rd Row 1st Column sigma 2 2nd Column sigma 1 3rd Column sigma squared 4th Column sigma 1 4th Row 1st Column sigma 3 2nd Column sigma 2 3rd Column sigma 1 4th Column sigma squared EndMatrix
Toeplitz with
two bands
TOEP(2) Start 4 By 4 Matrix 1st Row 1st Column sigma squared 2nd Column sigma 1 3rd Column 0 4th Column 0 2nd Row 1st Column sigma 1 2nd Column sigma squared 3rd Column sigma 1 4th Column 0 3rd Row 1st Column 0 2nd Column sigma 1 3rd Column sigma squared 4th Column sigma 1 4th Row 1st Column 0 2nd Column 0 3rd Column sigma 1 4th Column sigma squared EndMatrix
Spatial
power
SP(POW)(c) sigma squared Start 4 By 4 Matrix 1st Row 1st Column 1 2nd Column rho Superscript d 12 3rd Column rho Superscript d 13 4th Column rho Superscript d 14 2nd Row 1st Column rho Superscript d 21 2nd Column 1 3rd Column rho Superscript d 23 4th Column rho Superscript d 24 3rd Row 1st Column rho Superscript d 31 2nd Column rho Superscript d 32 3rd Column 1 4th Column rho Superscript d 34 4th Row 1st Column rho Superscript d 41 2nd Column rho Superscript d 42 3rd Column rho Superscript d 43 4th Column 1 EndMatrix
Heterogeneous
AR(1)
ARH(1) Start 4 By 4 Matrix 1st Row 1st Column sigma 1 squared 2nd Column sigma 1 sigma 2 rho 3rd Column sigma 1 sigma 3 rho squared 4th Column sigma 1 sigma 4 rho cubed 2nd Row 1st Column sigma 2 sigma 1 rho 2nd Column sigma 2 squared 3rd Column sigma 2 sigma 3 rho 4th Column sigma 2 sigma 4 rho squared 3rd Row 1st Column sigma 3 sigma 1 rho squared 2nd Column sigma 3 sigma 2 rho 3rd Column sigma 3 squared 4th Column sigma 3 sigma 4 rho 4th Row 1st Column sigma 4 sigma 1 rho cubed 2nd Column sigma 4 sigma 2 rho squared 3rd Column sigma 4 sigma 3 rho 4th Column sigma 4 squared EndMatrix
First-order
autoregressive
moving average
ARMA(1,1) sigma squared Start 4 By 4 Matrix 1st Row 1st Column 1 2nd Column gamma 3rd Column gamma rho 4th Column gamma rho squared 2nd Row 1st Column gamma 2nd Column 1 3rd Column gamma 4th Column gamma rho 3rd Row 1st Column gamma rho 2nd Column gamma 3rd Column 1 4th Column gamma 4th Row 1st Column gamma rho squared 2nd Column gamma rho 3rd Column gamma 4th Column 1 EndMatrix
Heterogeneous
CS
CSH Start 4 By 4 Matrix 1st Row 1st Column sigma 1 squared 2nd Column sigma 1 sigma 2 rho 3rd Column sigma 1 sigma 3 rho 4th Column sigma 1 sigma 4 rho 2nd Row 1st Column sigma 2 sigma 1 rho 2nd Column sigma 2 squared 3rd Column sigma 2 sigma 3 rho 4th Column sigma 2 sigma 4 rho 3rd Row 1st Column sigma 3 sigma 1 rho 2nd Column sigma 3 sigma 2 rho 3rd Column sigma 3 squared 4th Column sigma 3 sigma 4 rho 4th Row 1st Column sigma 4 sigma 1 rho 2nd Column sigma 4 sigma 2 rho 3rd Column sigma 4 sigma 3 rho 4th Column sigma 4 squared EndMatrix
First-order
factor
analytic
FA(1) Start 4 By 4 Matrix 1st Row 1st Column lamda 1 squared plus d 1 2nd Column lamda 1 lamda 2 3rd Column lamda 1 lamda 3 4th Column lamda 1 lamda 4 2nd Row 1st Column lamda 2 lamda 1 2nd Column lamda 2 squared plus d 2 3rd Column lamda 2 lamda 3 4th Column lamda 2 lamda 4 3rd Row 1st Column lamda 3 lamda 1 2nd Column lamda 3 lamda 2 3rd Column lamda 3 squared plus d 3 4th Column lamda 3 lamda 4 4th Row 1st Column lamda 4 lamda 1 2nd Column lamda 4 lamda 2 3rd Column lamda 4 lamda 3 4th Column lamda 4 squared plus d 4 EndMatrix
Huynh-Feldt HF Start 3 By 3 Matrix 1st Row 1st Column sigma 1 squared 2nd Column StartFraction sigma 1 squared plus sigma 2 squared Over 2 EndFraction minus lamda 3rd Column StartFraction sigma 1 squared plus sigma 3 squared Over 2 EndFraction minus lamda 2nd Row 1st Column StartFraction sigma 2 squared plus sigma 1 squared Over 2 EndFraction minus lamda 2nd Column sigma 2 squared 3rd Column StartFraction sigma 2 squared plus sigma 3 squared Over 2 EndFraction minus lamda 3rd Row 1st Column StartFraction sigma 3 squared plus sigma 1 squared Over 2 EndFraction minus lamda 2nd Column StartFraction sigma 3 squared plus sigma 2 squared Over 2 EndFraction minus lamda 3rd Column sigma 3 squared EndMatrix
First-order
antedependence
ANTE(1) Start 3 By 3 Matrix 1st Row 1st Column sigma 1 squared 2nd Column sigma 1 sigma 2 rho 1 3rd Column sigma 1 sigma 3 rho 1 rho 2 2nd Row 1st Column sigma 2 sigma 1 rho 1 2nd Column sigma 2 squared 3rd Column sigma 2 sigma 3 rho 2 3rd Row 1st Column sigma 3 sigma 1 rho 2 rho 1 2nd Column sigma 3 sigma 2 rho 2 3rd Column sigma 3 squared EndMatrix
Heterogeneous
Toeplitz
TOEPH Start 4 By 4 Matrix 1st Row 1st Column sigma 1 squared 2nd Column sigma 1 sigma 2 rho 1 3rd Column sigma 1 sigma 3 rho 2 4th Column sigma 1 sigma 4 rho 3 2nd Row 1st Column sigma 2 sigma 1 rho 1 2nd Column sigma 2 squared 3rd Column sigma 2 sigma 3 rho 1 4th Column sigma 2 sigma 4 rho 2 3rd Row 1st Column sigma 3 sigma 1 rho 2 2nd Column sigma 3 sigma 2 rho 1 3rd Column sigma 3 squared 4th Column sigma 3 sigma 4 rho 1 4th Row 1st Column sigma 4 sigma 1 rho 3 2nd Column sigma 4 sigma 2 rho 2 3rd Column sigma 4 sigma 3 rho 1 4th Column sigma 4 squared EndMatrix
Unstructured
correlations
UNR Start 4 By 4 Matrix 1st Row 1st Column sigma 1 squared 2nd Column sigma 1 sigma 2 rho 21 3rd Column sigma 1 sigma 3 rho 31 4th Column sigma 1 sigma 4 rho 41 2nd Row 1st Column sigma 2 sigma 1 rho 21 2nd Column sigma 2 squared 3rd Column sigma 2 sigma 3 rho 32 4th Column sigma 2 sigma 4 rho 42 3rd Row 1st Column sigma 3 sigma 1 rho 31 2nd Column sigma 3 sigma 2 rho 32 3rd Column sigma 3 squared 4th Column sigma 3 sigma 4 rho 43 4th Row 1st Column sigma 4 sigma 1 rho 41 2nd Column sigma 4 sigma 2 rho 42 3rd Column sigma 4 sigma 3 rho 43 4th Column sigma 4 squared EndMatrix
Direct product
AR(1)
UN@AR(1) Start 2 By 2 Matrix 1st Row 1st Column sigma 1 squared 2nd Column sigma 21 2nd Row 1st Column sigma 21 2nd Column sigma 2 squared EndMatrix circled-times Start 3 By 3 Matrix 1st Row 1st Column 1 2nd Column rho 3rd Column rho squared 2nd Row 1st Column rho 2nd Column 1 3rd Column rho 3rd Row 1st Column rho squared 2nd Column rho 3rd Column 1 EndMatrix equals
Start 6 By 6 Matrix 1st Row 1st Column sigma 1 squared 2nd Column sigma 1 squared rho 3rd Column sigma 1 squared rho squared 4th Column sigma 21 5th Column sigma 21 rho 6th Column sigma 21 rho squared 2nd Row 1st Column sigma 1 squared rho 2nd Column sigma 1 squared 3rd Column sigma 1 squared rho 4th Column sigma 21 rho 5th Column sigma 21 6th Column sigma 21 rho 3rd Row 1st Column sigma 1 squared rho squared 2nd Column sigma 1 squared rho 3rd Column sigma 1 squared 4th Column sigma 21 rho squared 5th Column sigma 21 rho 6th Column sigma 21 4th Row 1st Column sigma 21 2nd Column sigma 21 rho 3rd Column sigma 21 rho squared 4th Column sigma 2 squared 5th Column sigma 2 squared rho 6th Column sigma 2 squared rho squared 5th Row 1st Column sigma 21 rho 2nd Column sigma 21 3rd Column sigma 21 rho 4th Column sigma 2 squared rho 5th Column sigma 2 squared 6th Column sigma 2 squared rho 6th Row 1st Column sigma 21 rho squared 2nd Column sigma 21 rho 3rd Column sigma 21 4th Column sigma 2 squared rho squared 5th Column sigma 2 squared rho 6th Column sigma 2 squared EndMatrix


The following provides some further information about these covariance structures:

ANTE(1)

specifies the first-order antedependence structure (see Kenward 1987; Patel 1991; Macchiavelli and Arnold 1994). In Table 17, sigma Subscript i Superscript 2 is the ith variance parameter, and rho Subscript k is the kth autocorrelation parameter satisfying StartAbsoluteValue rho Subscript k Baseline EndAbsoluteValue less-than 1.

AR(1)

specifies a first-order autoregressive structure. PROC MIXED imposes the constraint StartAbsoluteValue rho EndAbsoluteValue less-than 1 for stationarity.

ARH(1)

specifies a heterogeneous first-order autoregressive structure. As with TYPE=AR(1), PROC MIXED imposes the constraint StartAbsoluteValue rho EndAbsoluteValue less-than 1 for stationarity.

ARMA(1,1)

specifies the first-order autoregressive moving-average structure. In Table 17, rho is the autoregressive parameter, gamma models a moving-average component, and sigma squared is the residual variance. In the notation of Fuller (1976, p. 68), rho equals theta 1 and

gamma equals StartFraction left-parenthesis 1 plus b 1 theta 1 right-parenthesis left-parenthesis theta 1 plus b 1 right-parenthesis Over 1 plus b 1 squared plus 2 b 1 theta 1 EndFraction

The example in Table 19 and StartAbsoluteValue b 1 EndAbsoluteValue less-than 1 imply that

b 1 equals StartFraction beta minus StartRoot beta squared minus 4 alpha squared EndRoot Over 2 alpha EndFraction

where alpha equals gamma minus rho and beta equals 1 plus rho squared minus 2 gamma rho. PROC MIXED imposes the constraints StartAbsoluteValue rho EndAbsoluteValue less-than 1 and StartAbsoluteValue gamma EndAbsoluteValue less-than 1 for stationarity, although for some values of rho and gamma in this region the resulting covariance matrix is not positive definite. When the estimated value of rho becomes negative, the computed covariance is multiplied by cosine left-parenthesis pi d Subscript i j Baseline right-parenthesis to account for the negativity.

CS

specifies the compound-symmetry structure, which has constant variance and constant covariance.

CSH

specifies the heterogeneous compound-symmetry structure. This structure has a different variance parameter for each diagonal element, and it uses the square roots of these parameters in the off-diagonal entries. In Table 17, sigma Subscript i Superscript 2 is the ith variance parameter, and rho is the correlation parameter satisfying StartAbsoluteValue rho EndAbsoluteValue less-than 1.

FA(q)

specifies the factor-analytic structure with q factors (Jennrich and Schluchter 1986). This structure is of the form bold upper Lamda bold upper Lamda prime plus bold upper D, where bold upper Lamda is a t times q rectangular matrix and bold upper D is a t times t diagonal matrix with t different parameters. When q > 1, the elements of bold upper Lamda in its upper-right corner (that is, the elements in the ith row and jth column for j > i) are set to zero to fix the rotation of the structure.

FA0(q)

is similar to the FA(q) structure except that no diagonal matrix bold upper D is included. When q < t—that is, when the number of factors is less than the dimension of the matrix—this structure is nonnegative definite but not of full rank. In this situation, you can use it for approximating an unstructured bold upper G matrix in the RANDOM statement or for combining with the LOCAL option in the REPEATED statement. When q = t, you can use this structure to constrain bold upper G to be nonnegative definite in the RANDOM statement.

FA1(q)

is similar to the TYPE=FA(q) structure except that all of the elements in bold upper D are constrained to be equal. This offers a useful and more parsimonious alternative to the full factor-analytic structure.

HF

specifies the Huynh-Feldt covariance structure (Huynh and Feldt 1970). This structure is similar to the TYPE=CSH structure in that it has the same number of parameters and heterogeneity along the main diagonal. However, it constructs the off-diagonal elements by taking arithmetic rather than geometric means.

You can perform a likelihood ratio test of the Huynh-Feldt conditions by running PROC MIXED twice, once with TYPE=HF and once with TYPE=UN, and then subtracting their respective values of –2 times the maximized likelihood.

If PROC MIXED does not converge under your Huynh-Feldt model, you can specify your own starting values with the PARMS statement. The default MIVQUE(0) starting values can sometimes be poor for this structure. A good choice for starting values is often the parameter estimates corresponding to an initial fit that uses TYPE=CS.

LIN(q)

specifies the general linear covariance structure with q parameters. This structure consists of a linear combination of known matrices that are input with the LDATA= option. This structure is very general, and you need to make sure that the variance matrix is positive definite. By default, PROC MIXED sets the initial values of the parameters to 1. You can use the PARMS statement to specify other initial values.

LINEAR(q)

is an alias for TYPE=LIN(q).

SIMPLE

is an alias for TYPE=VC.

SP(EXPA)(c-list)

specifies the spatial anisotropic exponential structure, where c-list is a list of variables indicating the coordinates. This structure has left-parenthesis i comma j right-parenthesis element equal to

sigma squared product Underscript k equals 1 Overscript c Endscripts exp left-brace minus theta Subscript k Baseline d left-parenthesis i comma j comma k right-parenthesis Superscript p Super Subscript k Superscript Baseline right-brace

where c is the number of coordinates and d left-parenthesis i comma j comma k right-parenthesis is the absolute distance between the kth coordinate (k equals 1 comma ellipsis comma c) of the ith and jth observations in the input data set. There are 2c + 1 parameters to be estimated: theta Subscript k, p Subscript k (k equals 1 comma ellipsis comma c), and sigma squared.

You might want to constrain some of the EXPA parameters to known values. For example, suppose you have three coordinate variables C1, C2, and C3 and you want to constrain the powers p Subscript k to equal 2, as in Sacks et al. (1989). Suppose further that you want to model covariance across the entire input data set and you suspect the theta Subscript k and sigma squared estimates are close to 3, 4, 5, and 1, respectively. Then specify the following statements:

repeated / type=sp(expa)(c1 c2 c3)
   subject=intercept;
parms (3) (4) (5) (2) (2) (2) (1) /
   hold=4,5,6;
SP(EXPGA)(sans-serif-italic c Subscript 1 Baseline sans-serif-italic c Subscript 2)

specify modification of the isotropic SP(EXP) covariance structure.

SP(GAUGA)(sans-serif-italic c Subscript 1 Baseline sans-serif-italic c Subscript 2)

specify modification of the isotropic SP(GAU) covariance structure.

SP(SPHGA)(sans-serif-italic c Subscript 1 Baseline sans-serif-italic c Subscript 2)

specify modification of the isotropic SP(SPH) covariance structure.

These are structures that allow for geometric anisotropy in two dimensions. The coordinates are specified by the variables c1 and c2.

If the spatial process is geometrically anisotropic in bold c equals left-bracket c Subscript i Baseline 1 Baseline comma c Subscript i Baseline 2 Baseline right-bracket, then it is isotropic in the coordinate system

bold upper A bold c equals Start 2 By 2 Matrix 1st Row 1st Column 1 2nd Column 0 2nd Row 1st Column 0 2nd Column lamda EndMatrix Start 2 By 2 Matrix 1st Row 1st Column cosine theta 2nd Column minus sine theta 2nd Row 1st Column sine theta 2nd Column cosine theta EndMatrix bold c equals bold c Superscript asterisk

for a properly chosen angle theta and scaling factor lamda. Elliptical isocorrelation contours are thereby transformed to spherical contours, adding two parameters to the respective isotropic covariance structures. Euclidean distances (see Table 18) are expressed in terms of bold c Superscript asterisk.

The angle theta of the clockwise rotation is reported in radians, 0 less-than-or-equal-to theta less-than-or-equal-to 2 pi. The scaling parameter lamda represents the ratio of the range parameters in the direction of the major and minor axis of the correlation contours. In other words, following a rotation of the coordinate system by angle theta, isotropy is achieved by compressing or magnifying distances in one coordinate by the factor lamda.

Fixing lamda equals 1.0 reduces the models to isotropic ones for any angle of rotation. If the scaling parameter is held constant at 1.0, you should also hold constant the angle of rotation, as in the following statements:

repeated / type=sp(expga)(gxc gyc)
           subject=intercept;
parms (6) (1.0) (0.0) (1) / hold=2,3;

If lamda is fixed at any other value than 1.0, the angle of rotation can be estimated. Specifying a starting grid of angles and scaling factors can considerably improve the convergence properties of the optimization algorithm for these models. Only a single random effect with geometrically anisotropic structure is permitted.

SP(MATERN)(c-list ) | SP(MATHSW)(c-list )

specifies covariance structures in the Matérn class of covariance functions (Matérn 1986). Two observations for the same subject (block of bold upper R) that are Euclidean distance d Subscript i j apart have covariance

sigma squared StartFraction 1 Over normal upper Gamma left-parenthesis nu right-parenthesis EndFraction left-parenthesis StartFraction d Subscript i j Baseline Over 2 rho EndFraction right-parenthesis Superscript nu Baseline 2 upper K Subscript nu Baseline left-parenthesis d Subscript i j Baseline slash rho right-parenthesis nu greater-than 0 comma rho greater-than 0

where upper K Subscript nu is the modified Bessel function of the second kind of (real) order nu greater-than 0. The smoothness (continuity) of a stochastic process with covariance function in this class increases with nu. The Matérn class thus enables data-driven estimation of the smoothness properties. The covariance is identical to the exponential model for nu equals 0.5 (TYPE=SP(EXP)(c-list)), while for nu equals 1 the model advocated by Whittle (1954) results. As nu right-arrow normal infinity the model approaches the gaussian covariance structure (TYPE=SP(GAU)(c-list)).

The MATHSW structure represents the Matérn class in the parameterization of Handcock and Stein (1993) and Handcock and Wallis (1994),

sigma squared StartFraction 1 Over normal upper Gamma left-parenthesis nu right-parenthesis EndFraction left-parenthesis StartFraction d Subscript i j Baseline StartRoot nu EndRoot Over rho EndFraction right-parenthesis Superscript nu Baseline 2 upper K Subscript nu Baseline left-parenthesis StartFraction 2 d Subscript i j Baseline StartRoot nu EndRoot Over rho EndFraction right-parenthesis

Since computation of the function upper K Subscript nu and its derivatives is numerically very intensive, fitting models with Matérn covariance structures can be more time-consuming than with other spatial covariance structures. Good starting values are essential.

SP(POW)(c-list) | SP(POWA)(c-list)

specifies the spatial power structures. When the estimated value of rho becomes negative, the computed covariance is multiplied by cosine left-parenthesis pi d Subscript i j Baseline right-parenthesis to account for the negativity.

SP(LEAR)(c-list)

specifies a linear exponent autoregressive (LEAR) correlation structure as proposed by Simpson et al. (2010). For two observations with distance metric d Subscript i j, the covariance is

normal upper C normal o normal v left-bracket xi Subscript i Baseline comma xi Subscript j Baseline right-bracket equals sigma squared StartLayout Enlarged left-brace 1st Row 1st Column rho Superscript d Super Subscript normal m normal i normal n Superscript plus delta left-bracket left-parenthesis d Super Subscript i j Superscript minus d Super Subscript normal m normal i normal n Superscript right-parenthesis slash left-parenthesis d Super Subscript normal m normal a normal x Superscript minus d Super Subscript normal m normal i normal n Superscript right-parenthesis right-bracket Baseline 2nd Column i not-equals j and d Subscript normal m normal i normal n Baseline not-equals d Subscript normal m normal a normal x Baseline 2nd Row 1st Column rho Superscript d Super Subscript normal m normal i normal n Baseline 2nd Column i not-equals j and d Subscript normal m normal i normal n Baseline equals d Subscript normal m normal a normal x Baseline 3rd Row 1st Column 1 2nd Column i equals j EndLayout

where d Subscript normal m normal i normal n and d Subscript normal m normal a normal x are the smallest and largest distance between any two observations, delta greater-than-or-equal-to 0 is the decay speed, and 0 less-than-or-equal-to rho less-than 1. See TYPE=SP(EXP) for the computation of the distance d Subscript i j from the variables specified in c-list. When the estimated value of rho becomes negative, the computed covariance is multiplied by cosine left-parenthesis pi d Subscript i j Baseline right-parenthesis to account for the negativity.

For power analysis of repeated measures designs that have a LEAR correlation structure, see the section POWER Statement in Chapter 55, The GLMPOWER Procedure.

Note that TYPE=SP(LEAR) is not supported for GROUP= option in this SAS release.

TOEP<(q)>

specifies a banded Toeplitz structure. This can be viewed as a moving-average structure with order equal to sans-serif-italic q negative 1. The TYPE=TOEP option is a full Toeplitz matrix, which can be viewed as an autoregressive structure with order equal to the dimension of the matrix. The specification TYPE=TOEP(1) is the same as sigma squared upper I, where I is an identity matrix, and it can be useful for specifying the same variance component for several effects.

TOEPH<(q)>

specifies a heterogeneous banded Toeplitz structure. In Table 17, sigma Subscript i Superscript 2 is the ith variance parameter and rho Subscript j is the jth correlation parameter satisfying StartAbsoluteValue rho Subscript j Baseline EndAbsoluteValue less-than 1. If you specify the order parameter q, then PROC MIXED estimates only the first q bands of the matrix, setting all higher bands equal to 0. The option TOEPH(1) is equivalent to both the TYPE=UN(1) and TYPE=UNR(1) options.

UN<(q)>

specifies a completely general (unstructured) covariance matrix parameterized directly in terms of variances and covariances. The variances are constrained to be nonnegative, and the covariances are unconstrained. This structure is not constrained to be nonnegative definite in order to avoid nonlinear constraints; however, you can use the TYPE=FA0 structure if you want this constraint to be imposed by a Cholesky factorization. If you specify the order parameter q, then PROC MIXED estimates only the first q bands of the matrix, setting all higher bands equal to 0.

UNR<(q)>

specifies a completely general (unstructured) covariance matrix parameterized in terms of variances and correlations. This structure fits the same model as the TYPE=UN(q) option but with a different parameterization. The ith variance parameter is sigma Subscript i Superscript 2. The parameter rho Subscript j k is the correlation between the jth and kth measurements; it satisfies StartAbsoluteValue rho Subscript j k Baseline EndAbsoluteValue less-than 1. If you specify the order parameter r, then PROC MIXED estimates only the first q bands of the matrix, setting all higher bands equal to zero.

UN@AR(1) | UN@CS | UN@UN

specify direct (Kronecker) product structures designed for multivariate repeated measures (see Galecki 1994). These structures are constructed by taking the Kronecker product of an unstructured matrix (modeling covariance across the multivariate observations) with an additional covariance matrix (modeling covariance across time or another factor). The upper-left value in the second matrix is constrained to equal 1 to identify the model. See the SAS/IML User's Guide for more details about direct products.

To use these structures in the REPEATED statement, you must specify two distinct REPEATED effects, both of which must be included in the CLASS statement. The first effect indicates the multivariate observations, and the second identifies the levels of time or some additional factor. Note that the input data set must still be constructed in "univariate" format; that is, all dependent observations are still listed observation-wise in one single variable. Although this construction provides for general modeling possibilities, it forces you to construct variables indicating both dimensions of the Kronecker product.

For example, suppose your observed data consist of heights and weights of several children measured over several successive years. Your input data set should then contain variables similar to the following:

  • Y, all of the heights and weights, with a separate observation for each

  • Var, indicating whether the measurement is a height or a weight

  • Year, indicating the year of measurement

  • Child, indicating the child on which the measurement was taken

Your PROC MIXED statements for a Kronecker AR(1) structure across years would then be as follows:

proc mixed;
   class Var Year Child;
   model Y = Var Year Var*Year;
   repeated Var Year / type=un@ar(1)
                       subject=Child;
run;

You should nearly always want to model different means for the multivariate observations; hence the inclusion of Var in the MODEL statement. The preceding mean model consists of cell means for all combinations of VAR and YEAR.

VC

specifies standard variance components and is the default structure for both the RANDOM and REPEATED statements. In the RANDOM statement, a distinct variance component is assigned to each effect. In the REPEATED statement, this structure is usually used only with the GROUP= option to specify a heterogeneous variance model.

Jennrich and Schluchter (1986) provide general information about the use of covariance structures, and Wolfinger (1996) presents details about many of the heterogeneous structures. Modeling with spatial covariance structures is discussed in many sources (Marx and Thompson 1987; Zimmerman and Harville 1991; Cressie 1993; Brownie, Bowman, and Burton 1993; Stroup, Baenziger, and Mulitze 1994; Brownie and Gumpertz 1997; Gotway and Stroup 1997; Chilès and Delfiner 1999; Schabenberger and Gotway 2005; Littell et al. 2006).

Last updated: December 09, 2022